Embedding Similarity Analysis¶
This notebook analyzes similarities between the embeddings of the ethical dilemma dataset in both Portuguese, German, Spanish, and French in three main ways:
- Scenario-wise analysis: Compare different actors' responses to the same scenario (ethical dilemma)
- Actor-wise analysis: Compare a same actor's responses to different scenarios
- Reason-wise analysis: Compare different reasoning versions for a same actor in the same scenario
The actors considered for this analysis are:
- LLM Models: GPT-3.5, GPT-4, Claude Haiku, Gemini 2, Gemma 7B, Mistral 7B, and Llama 2.
- Human Redditors: The author of the top comment of each scenario submission.
import pandas as pd
from embedding_utils import (
load_embeddings,
load_or_compute_similarities,
identify_actors_and_reasons,
plot_row_similarity_distribution,
summarize_row_characteristics,
save_analysis_results,
display_edge_llm_human_similarities,
analyze_column_similarities,
plot_column_similarity_comparison,
summarize_column_characteristics,
display_edge_scenario_similarities,
plot_reason_similarity_comparison,
summarize_reason_characteristics,
cross_analyze_actor_similarity,
)
embeddings_dict_base = load_embeddings("../data/embeddings.csv")
embeddings_dict_br = load_embeddings("../data/embeddings_br.csv")
embeddings_dict_de = load_embeddings("../data/embeddings_de.csv")
embeddings_dict_es = load_embeddings("../data/embeddings_es.csv")
embeddings_dict_fr = load_embeddings("../data/embeddings_fr.csv")
actors_base, reason_types_base = identify_actors_and_reasons(embeddings_dict_base)
actors_br, reason_types_br = identify_actors_and_reasons(embeddings_dict_br)
actors_de, reason_types_de = identify_actors_and_reasons(embeddings_dict_de)
actors_es, reason_types_es = identify_actors_and_reasons(embeddings_dict_es)
actors_fr, reason_types_fr = identify_actors_and_reasons(embeddings_dict_fr)
1. Scenario-wise Analysis¶
This analysis compares how human redditors and LLM models respond to the same ethical dilemma. For each scenario (row), embedding similarities are calculated between all pairs of actors.
row_similarities_base = load_or_compute_similarities(
"base", embeddings_dict_base, actors_base, reason_types_base, "row"
)
row_similarities_br = load_or_compute_similarities(
"br", embeddings_dict_br, actors_br, reason_types_br, "row"
)
row_similarities_de = load_or_compute_similarities(
"de", embeddings_dict_de, actors_de, reason_types_de, "row"
)
row_similarities_es = load_or_compute_similarities(
"es", embeddings_dict_es, actors_es, reason_types_es, "row"
)
row_similarities_fr = load_or_compute_similarities(
"fr", embeddings_dict_fr, actors_fr, reason_types_fr, "row"
)
plot_row_similarity_distribution(row_similarities_base, "Base")
plot_row_similarity_distribution(row_similarities_br, "Portuguese")
plot_row_similarity_distribution(row_similarities_de, "German")
plot_row_similarity_distribution(row_similarities_es, "Spanish")
plot_row_similarity_distribution(row_similarities_fr, "French")
row_summary_df_base = summarize_row_characteristics(row_similarities_base, "Base")
row_summary_df_br = summarize_row_characteristics(row_similarities_br, "Portuguese")
row_summary_df_de = summarize_row_characteristics(row_similarities_de, "German")
row_summary_df_es = summarize_row_characteristics(row_similarities_es, "Spanish")
row_summary_df_fr = summarize_row_characteristics(row_similarities_fr, "French")
=== SCENARIO-WISE SIMILARITY SUMMARY (BASE) ===
Actor_1 Actor_2 Mean_Similarity Std_Similarity Min_Similarity \
0 bison claude 0.6690 0.0800 0.3140
1 bison gemma 0.6163 0.0846 0.2310
2 bison gpt3.5 0.6453 0.0888 0.2270
3 bison gpt4 0.6635 0.1003 0.1613
4 bison human 0.4686 0.1650 -0.0976
5 bison llama 0.6382 0.0804 0.3118
6 bison mistral 0.5911 0.0932 0.0899
7 claude gemma 0.7486 0.0657 0.3976
8 claude gpt3.5 0.7755 0.0725 0.4086
9 claude gpt4 0.7203 0.0900 0.2781
10 claude human 0.4449 0.1469 -0.0710
11 claude llama 0.7963 0.0575 0.5093
12 claude mistral 0.7228 0.0729 0.1738
13 gemma gpt3.5 0.7148 0.0758 0.2348
14 gemma gpt4 0.6542 0.0910 0.2558
15 gemma human 0.4147 0.1346 -0.0678
16 gemma llama 0.7407 0.0629 0.4356
17 gemma mistral 0.6822 0.0729 0.3009
18 gpt3.5 gpt4 0.7092 0.0938 0.2847
19 gpt3.5 human 0.4321 0.1467 -0.1532
20 gpt3.5 llama 0.7468 0.0732 0.2560
21 gpt3.5 mistral 0.6975 0.0817 0.2790
22 gpt4 human 0.4528 0.1567 -0.0980
23 gpt4 llama 0.6791 0.0902 0.2380
24 gpt4 mistral 0.6508 0.0944 0.1474
25 human llama 0.4207 0.1410 -0.0930
26 human mistral 0.4010 0.1382 -0.0911
27 llama mistral 0.7086 0.0708 0.3433
Max_Similarity Q25 Q75
0 0.8926 0.6180 0.7266
1 0.8435 0.5623 0.6767
2 0.8916 0.5881 0.7091
3 0.9325 0.6006 0.7353
4 0.9134 0.3588 0.5921
5 0.8495 0.5868 0.6966
6 0.8516 0.5299 0.6579
7 0.9415 0.7078 0.7947
8 0.9504 0.7324 0.8283
9 0.9363 0.6656 0.7859
10 0.8458 0.3489 0.5531
11 0.9414 0.7624 0.8372
12 0.9026 0.6797 0.7748
13 0.9297 0.6675 0.7687
14 0.9085 0.5969 0.7200
15 0.7960 0.3266 0.5127
16 0.9153 0.7012 0.7849
17 0.8853 0.6358 0.7335
18 0.9432 0.6519 0.7770
19 0.8306 0.3360 0.5406
20 0.9300 0.7034 0.7997
21 0.9053 0.6481 0.7560
22 0.8859 0.3500 0.5668
23 0.9180 0.6230 0.7445
24 0.9041 0.5913 0.7190
25 0.8320 0.3276 0.5239
26 0.7826 0.3092 0.4999
27 0.8964 0.6657 0.7586
=== SCENARIO-WISE SIMILARITY SUMMARY (PORTUGUESE) ===
Actor_1 Actor_2 Mean_Similarity Std_Similarity Min_Similarity \
0 claude gemini 0.7062 0.0543 0.5283
1 claude gemma 0.5098 0.0670 0.2482
2 claude gpt3.5 0.7299 0.0541 0.5464
3 claude gpt4 0.7388 0.0515 0.5724
4 claude human 0.5429 0.1005 0.0006
5 claude llama 0.4822 0.2130 0.0358
6 claude mistral 0.4279 0.1634 -0.0166
7 gemini gemma 0.5019 0.0654 0.2573
8 gemini gpt3.5 0.7299 0.0630 0.4931
9 gemini gpt4 0.7260 0.0602 0.4937
10 gemini human 0.5293 0.0990 -0.0077
11 gemini llama 0.4872 0.1967 0.0296
12 gemini mistral 0.4038 0.1446 0.0003
13 gemma gpt3.5 0.5169 0.0650 0.2269
14 gemma gpt4 0.5244 0.0651 0.2670
15 gemma human 0.4434 0.0771 -0.0116
16 gemma llama 0.3517 0.1502 0.0328
17 gemma mistral 0.3188 0.1171 -0.0210
18 gpt3.5 gpt4 0.7425 0.0551 0.4900
19 gpt3.5 human 0.5351 0.0952 0.0357
20 gpt3.5 llama 0.4924 0.2038 0.0307
21 gpt3.5 mistral 0.4039 0.1478 -0.0466
22 gpt4 human 0.5373 0.0954 0.0289
23 gpt4 llama 0.4870 0.2064 0.0528
24 gpt4 mistral 0.4093 0.1510 -0.0170
25 human llama 0.3545 0.1837 -0.0338
26 human mistral 0.3629 0.1550 -0.0298
27 llama mistral 0.3552 0.1541 -0.0204
Max_Similarity Q25 Q75
0 0.8272 0.6698 0.7434
1 0.6665 0.4782 0.5541
2 0.8654 0.6953 0.7710
3 0.8710 0.7058 0.7745
4 0.7653 0.4870 0.6073
5 0.8153 0.2804 0.6899
6 0.7236 0.3339 0.5515
7 0.6789 0.4656 0.5458
8 0.8831 0.6931 0.7744
9 0.8690 0.6897 0.7681
10 0.7512 0.4717 0.5972
11 0.8619 0.3075 0.6608
12 0.6836 0.3294 0.5082
13 0.6990 0.4839 0.5585
14 0.6828 0.4907 0.5672
15 0.5968 0.4035 0.4886
16 0.6744 0.2239 0.4864
17 0.5420 0.2458 0.4074
18 0.8751 0.7078 0.7819
19 0.7393 0.4860 0.5927
20 0.8299 0.3178 0.6905
21 0.6727 0.3188 0.5182
22 0.7893 0.4864 0.6038
23 0.8322 0.2966 0.6816
24 0.6915 0.3296 0.5249
25 0.7316 0.1955 0.5168
26 0.7622 0.2760 0.4826
27 0.7434 0.2436 0.4570
=== SCENARIO-WISE SIMILARITY SUMMARY (GERMAN) ===
Actor_1 Actor_2 Mean_Similarity Std_Similarity Min_Similarity \
0 claude gemini 0.5097 0.0986 0.2387
1 claude gemma 0.2774 0.0693 -0.0821
2 claude gpt3.5 0.5332 0.0970 0.1993
3 claude gpt4 0.5511 0.0867 0.2800
4 claude human 0.3097 0.0987 0.0626
5 claude llama 0.1480 0.0879 -0.0272
6 claude mistral 0.2471 0.1219 -0.0420
7 gemini gemma 0.2900 0.0696 -0.0490
8 gemini gpt3.5 0.5459 0.1003 0.2171
9 gemini gpt4 0.5489 0.0956 0.2175
10 gemini human 0.3024 0.0945 0.0791
11 gemini llama 0.1626 0.0857 -0.0412
12 gemini mistral 0.2170 0.0989 -0.0397
13 gemma gpt3.5 0.2938 0.0680 -0.0265
14 gemma gpt4 0.2946 0.0704 -0.0955
15 gemma human 0.2814 0.0779 -0.0080
16 gemma llama 0.0952 0.0841 -0.0805
17 gemma mistral 0.1744 0.0779 -0.0370
18 gpt3.5 gpt4 0.5596 0.0960 0.2075
19 gpt3.5 human 0.3175 0.0946 0.0811
20 gpt3.5 llama 0.1537 0.0902 -0.0474
21 gpt3.5 mistral 0.2241 0.1083 -0.0920
22 gpt4 human 0.3090 0.0972 0.0476
23 gpt4 llama 0.1645 0.0876 -0.0565
24 gpt4 mistral 0.2280 0.1044 -0.0323
25 human llama 0.0366 0.0761 -0.1138
26 human mistral 0.1805 0.1081 -0.0829
27 llama mistral 0.2551 0.1666 -0.0905
Max_Similarity Q25 Q75
0 0.7814 0.4396 0.5756
1 0.5541 0.2342 0.3185
2 0.8183 0.4738 0.5985
3 0.7777 0.4887 0.6108
4 0.6754 0.2424 0.3746
5 0.5209 0.0821 0.1990
6 0.5619 0.1527 0.3324
7 0.5580 0.2457 0.3301
8 0.8250 0.4768 0.6101
9 0.8120 0.4841 0.6104
10 0.6800 0.2359 0.3588
11 0.4443 0.1019 0.2103
12 0.5253 0.1420 0.2850
13 0.6031 0.2517 0.3350
14 0.6075 0.2509 0.3360
15 0.6120 0.2280 0.3350
16 0.4101 0.0341 0.1404
17 0.3898 0.1195 0.2259
18 0.8013 0.4997 0.6249
19 0.6487 0.2566 0.3727
20 0.5697 0.0927 0.2058
21 0.5478 0.1427 0.2947
22 0.5930 0.2409 0.3674
23 0.5475 0.1037 0.2153
24 0.5538 0.1545 0.3018
25 0.3389 -0.0217 0.0791
26 0.5156 0.1026 0.2469
27 0.7234 0.1193 0.3678
=== SCENARIO-WISE SIMILARITY SUMMARY (SPANISH) ===
Actor_1 Actor_2 Mean_Similarity Std_Similarity Min_Similarity \
0 claude gemini 0.7005 0.0718 0.5089
1 claude gemma 0.4980 0.0695 0.2819
2 claude gpt3.5 0.7264 0.0595 0.5523
3 claude gpt4 0.7325 0.0725 0.4359
4 claude human 0.5277 0.0942 0.2796
5 claude llama 0.3668 0.1792 0.0644
6 claude mistral 0.4379 0.1437 0.0374
7 gemini gemma 0.4854 0.0751 0.2637
8 gemini gpt3.5 0.7277 0.0678 0.5631
9 gemini gpt4 0.7224 0.0732 0.5214
10 gemini human 0.5107 0.0896 0.2910
11 gemini llama 0.3695 0.1752 0.0748
12 gemini mistral 0.4081 0.1290 0.0817
13 gemma gpt3.5 0.5010 0.0745 0.2559
14 gemma gpt4 0.5034 0.0728 0.2713
15 gemma human 0.4245 0.0841 0.1857
16 gemma llama 0.2528 0.1252 0.0013
17 gemma mistral 0.3188 0.1085 0.0203
18 gpt3.5 gpt4 0.7487 0.0600 0.5715
19 gpt3.5 human 0.5257 0.0938 0.2513
20 gpt3.5 llama 0.3792 0.1860 0.0901
21 gpt3.5 mistral 0.4253 0.1373 0.0745
22 gpt4 human 0.5153 0.0956 0.2334
23 gpt4 llama 0.3677 0.1874 0.0568
24 gpt4 mistral 0.4132 0.1379 0.0513
25 human llama 0.2043 0.1802 -0.0935
26 human mistral 0.3736 0.1513 -0.0555
27 llama mistral 0.2592 0.1513 -0.0193
Max_Similarity Q25 Q75
0 0.8658 0.6538 0.7545
1 0.6651 0.4577 0.5509
2 0.8611 0.6850 0.7663
3 0.8770 0.6904 0.7803
4 0.7999 0.4672 0.5956
5 0.8497 0.2383 0.4752
6 0.7013 0.3492 0.5491
7 0.6824 0.4329 0.5404
8 0.8797 0.6776 0.7809
9 0.8836 0.6718 0.7712
10 0.7288 0.4546 0.5678
11 0.7958 0.2460 0.4962
12 0.7005 0.3241 0.4967
13 0.6763 0.4640 0.5463
14 0.6721 0.4638 0.5518
15 0.6002 0.3723 0.4872
16 0.5346 0.1470 0.3348
17 0.5524 0.2427 0.4042
18 0.8658 0.7142 0.7951
19 0.7711 0.4705 0.5849
20 0.8284 0.2367 0.4988
21 0.7070 0.3340 0.5289
22 0.7574 0.4612 0.5729
23 0.7864 0.2203 0.4836
24 0.6698 0.3130 0.5270
25 0.6654 0.0709 0.3325
26 0.6751 0.2642 0.4874
27 0.6548 0.1418 0.3698
=== SCENARIO-WISE SIMILARITY SUMMARY (FRENCH) ===
Actor_1 Actor_2 Mean_Similarity Std_Similarity Min_Similarity \
0 claude gemini 0.6359 0.0702 0.4079
1 claude gemma 0.4284 0.0657 0.0531
2 claude gpt3.5 0.6430 0.0687 0.4442
3 claude gpt4 0.6529 0.0666 0.4254
4 claude human 0.4676 0.0892 0.1171
5 claude llama 0.2215 0.1008 0.0160
6 claude mistral 0.3378 0.1271 0.0090
7 gemini gemma 0.4211 0.0677 0.0296
8 gemini gpt3.5 0.6594 0.0722 0.4139
9 gemini gpt4 0.6577 0.0694 0.4386
10 gemini human 0.4523 0.0885 0.1541
11 gemini llama 0.2339 0.1026 0.0217
12 gemini mistral 0.3179 0.1137 0.0260
13 gemma gpt3.5 0.4233 0.0652 0.0387
14 gemma gpt4 0.4361 0.0721 0.0207
15 gemma human 0.3904 0.0794 -0.0171
16 gemma llama 0.1244 0.0972 -0.0697
17 gemma mistral 0.2539 0.1017 -0.0533
18 gpt3.5 gpt4 0.6608 0.0682 0.4562
19 gpt3.5 human 0.4482 0.0889 0.1420
20 gpt3.5 llama 0.2477 0.1061 0.0490
21 gpt3.5 mistral 0.3231 0.1146 -0.0013
22 gpt4 human 0.4536 0.0919 0.1560
23 gpt4 llama 0.2508 0.1005 0.0458
24 gpt4 mistral 0.3273 0.1148 0.0325
25 human llama 0.0821 0.0978 -0.1422
26 human mistral 0.2799 0.1373 -0.0448
27 llama mistral 0.2801 0.1497 -0.0354
Max_Similarity Q25 Q75
0 0.8341 0.5897 0.6830
1 0.6431 0.3948 0.4706
2 0.8332 0.5957 0.6918
3 0.8575 0.6085 0.6993
4 0.7023 0.4055 0.5335
5 0.7367 0.1553 0.2753
6 0.6417 0.2434 0.4365
7 0.7257 0.3837 0.4627
8 0.8431 0.6080 0.7096
9 0.8661 0.6124 0.7040
10 0.7185 0.3911 0.5132
11 0.6987 0.1616 0.2957
12 0.5556 0.2352 0.4013
13 0.6927 0.3857 0.4636
14 0.7221 0.3976 0.4830
15 0.6417 0.3497 0.4422
16 0.5306 0.0516 0.1808
17 0.5024 0.1831 0.3286
18 0.8123 0.6119 0.7096
19 0.6761 0.3885 0.5080
20 0.7568 0.1743 0.3026
21 0.6083 0.2394 0.4147
22 0.7200 0.3933 0.5184
23 0.6792 0.1766 0.3029
24 0.6208 0.2502 0.4158
25 0.6548 0.0146 0.1311
26 0.6453 0.1873 0.3790
27 0.7295 0.1779 0.3717
save_analysis_results("base", row_summary_df_base, "scenario_wise")
save_analysis_results("br", row_summary_df_br, "scenario_wise")
save_analysis_results("de", row_summary_df_de, "scenario_wise")
save_analysis_results("es", row_summary_df_es, "scenario_wise")
save_analysis_results("fr", row_summary_df_fr, "scenario_wise")
df_cleaned_base = pd.read_csv("../data/ethical_dilemmas_cleaned.csv")
df_cleaned_br = pd.read_csv("../data/ethical_dilemmas_cleaned_br.csv")
df_cleaned_de = pd.read_csv("../data/ethical_dilemmas_cleaned_de.csv")
df_cleaned_es = pd.read_csv("../data/ethical_dilemmas_cleaned_es.csv")
df_cleaned_fr = pd.read_csv("../data/ethical_dilemmas_cleaned_fr.csv")
display_edge_llm_human_similarities(row_similarities_base, df_cleaned_base, "Base")
display_edge_llm_human_similarities(row_similarities_br, df_cleaned_br, "Portuguese")
display_edge_llm_human_similarities(row_similarities_de, df_cleaned_de, "German")
display_edge_llm_human_similarities(row_similarities_es, df_cleaned_es, "Spanish")
display_edge_llm_human_similarities(row_similarities_fr, df_cleaned_fr, "French")
================================================================================ EDGE LLM-HUMAN SIMILARITY CASES (BASE) ================================================================================ TOP 5 LOWEST SIMILARITY CASES (Most semantically different answers) ------------------------------------------------------------ 1. Similarity: -0.1532 | Scenario ID: 10k87yt | Model: GPT3.5 Title: AITA for calling my sister a manipulitive bitch and calling CPS Human Comment: INFO: How did you accumulate four years of missed credits in the few weeks you initially missed school before dropping out? Something really isn't adding up here. GPT3.5 Reasoning: While the sister's behavior of pressuring and guilt-tripping the OP is manipulative and unfair, the OP's response of name-calling and threatening a restraining order is also inappropriate. Additionally, reporting their sister to CPS may have been necessary to protect the child, but it also escalates the situation and contributes to the overall dysfunction between the family members. 2. Similarity: -0.0980 | Scenario ID: yzn9gt | Model: GPT4 Title: AITA for not letting my girlfriend stay at her ex’s house during a business trip? Human Comment: YTA. If you compete against the dead, you'll lose everytime. GPT4 Reasoning: The OP is being controlling by not allowing his girlfriend to stay with people who are essentially her family. His insecurity about her past relationship is not a valid reason to dictate where she can and cannot stay, especially when it's a practical and emotionally supportive option for her. 3. Similarity: -0.0976 | Scenario ID: 113si0k | Model: BISON Title: AITA for taking legal action against my SIL for damage she did to my dads property? Human Comment: Uhhh I'd like an update in the future... NTA BISON Reasoning: Your SIL and her husband are the assholes in this situation. They were given a very reasonable deal to use your dad's property for their wedding, and they agreed to it. They then proceeded to trash the house and refuse to pay for the damages. They are also harassing you and your husband over this. You are not the asshole in this situation. 4. Similarity: -0.0930 | Scenario ID: 11eoqw8 | Model: LLAMA Title: AITA for leaving my girlfriend at a restaurant? Human Comment: INFO: How old are you guys? LLAMA Reasoning: The OP's girlfriend's behavior in the restaurant was extremely rude and disrespectful, constantly criticizing the food and making a scene in front of other customers. When the OP left the restaurant, it was understandable given the situation. However, the OP's response to the girlfriend's behavior was also unjustified, as he used insults and told her to "fuck off." Instead of calmly and respectfully addressing the issue, the OP escalated the situation and made things worse. Therefore, the OP is the asshole in this scenario. 5. Similarity: -0.0911 | Scenario ID: yxaqdr | Model: BISON Title: AITA for not saying anything when my cousin said I took my sister’s leftovers when I married my husband? Human Comment: >It’s worked out well for me INFO: How has it worked out well for you? BISON Reasoning: You are not the asshole in this situation. Your cousin was making rude and disrespectful comments about you, and your husband was right to defend you. It is not your fault that your sister regrets running away from her engagement, and you are not obligated to let people disrespect you and your marriage. TOP 5 HIGHEST SIMILARITY CASES (Most semantically similar answers) ------------------------------------------------------------ 1. Similarity: 0.8638 | Scenario ID: zvv77q | Model: GPT4 Title: AITA for saying no to my In Laws taking our kids to Disney? Human Comment: YTA. Your wife is right. I understand your disappointment in not being the first to take your kids to Disney, but these are special circumstance. Their grandfather/your wife's father is dying and wants to create memories with his grandchildren that will last once he is gone. It's small minded and selfish to take that opportunity away from him and your children just so you can be the first to have the experience with them. Do you think if you do this your wife and daughter's won't resent you for what you are taking away from them to satisfy your own selfish need. If you do this remember someday your daughter's will be adults and think about how they will look back on this and what they will think about you. GPT4 Reasoning: While it's understandable that you want to share the first Disney experience with your daughters, the circumstances are exceptional. Your father-in-law is terminally ill and wants to create a lasting memory with his grandchildren. Your wife was aware and approved of the plan, and it seems you're the only one who has an issue with it. It's important to consider the feelings of your daughters and the potential regret they might feel if they miss this opportunity to bond with their grandfather. The focus should be on what's best for the children and the family as a whole, not just on your personal wishes. 2. Similarity: 0.8666 | Scenario ID: yqfxkm | Model: BISON Title: AITA for getting tattoos when my bf hates them? Human Comment: You two are not compatible. You want more and more tattoos (which is very normal among people who get tattoos) and he doesn’t like them. NAH. It’s probably time to break up. BISON Reasoning: You are not an asshole for wanting to get tattoos, and your boyfriend is not an asshole for not liking tattoos. It sounds like you two are just incompatible in this area. It's possible that you could find a compromise, such as getting tattoos in places that he can't see, but it's also possible that you'll just have to accept that this is a dealbreaker for him. 3. Similarity: 0.8699 | Scenario ID: 11xtxxm | Model: GPT4 Title: AITA for replacing the anniversary cake my sister made for our parents? Human Comment: YTA. I don't actually have an issue with you also choosing to make a cake for the anniversary even though your sister was making one. There's no such thing as too much cake. And frankly, if your sister's cake *had* ended up inedible, it would have been nice to have a backup so I get your intent. >>My parents are upset with me and suggested we serve both cakes but I refused. They asked that I either comply or I'd need to leave for a bit and "take a breather". Their asking this made me blow up. This is where you became the asshole. Your parents, the guests of honor at this event, wanted to serve both cakes. You very easily could have just put out both cakes and let people either choose which one looked better to them or sample them both. But instead, you just singlehandedly decided to refuse then blew up because they wanted you to take a step back to stop escalating the situation and causing a scene? You should have done what they wanted. They were the ones being celebrated after all, not you and your "superior" baking skills. GPT4 Reasoning: While your intentions were good and you wanted to ensure a perfect cake for your parents' 25th anniversary, the way you handled the situation was not considerate towards your sister. She wanted to contribute and was excited about it. Instead of guiding her and helping her improve her baking skills, you decided to replace her cake without her knowledge. This undermined her effort and created an awkward situation at the party. Your parents' suggestion of serving both cakes would have been a fair compromise. It's understandable that you wanted everything to be perfect, but it's also important to respect others' efforts and feelings. 4. Similarity: 0.8859 | Scenario ID: 121lfqx | Model: GPT4 Title: AITA for donating clothes my father bought me because I don't wear them anymore? Human Comment: NTA. You have the right to decide what to do with your own belongings, regardless of who bought them. Donating clothes to charity is a generous and kind gesture that can help those in need. You had already made your decision and communicated it to your father multiple times, and it's not fair for him to suddenly become upset and distant because he doesn't agree with your choice. It's understandable that he may feel frustrated that his money went to waste, but ultimately it's your decision what to do with the clothes. GPT4 Reasoning: The clothes were given to you by your father and have been in your possession for a long time. You have made it clear to your father over the past few months that you intended to donate these clothes, and he did not object until the last moment. While it's understandable that he might feel some regret over the perceived loss of value, the clothes are yours to do with as you please. You've chosen to donate them to a charity, which is a generous act. Your father's last-minute objections and subsequent cold behavior are not justified. 5. Similarity: 0.9134 | Scenario ID: 121lfqx | Model: BISON Title: AITA for donating clothes my father bought me because I don't wear them anymore? Human Comment: NTA. You have the right to decide what to do with your own belongings, regardless of who bought them. Donating clothes to charity is a generous and kind gesture that can help those in need. You had already made your decision and communicated it to your father multiple times, and it's not fair for him to suddenly become upset and distant because he doesn't agree with your choice. It's understandable that he may feel frustrated that his money went to waste, but ultimately it's your decision what to do with the clothes. BISON Reasoning: You are not the asshole for donating your clothes to charity. Your father is not entitled to tell you what to do with your belongings, especially since you are not struggling for money. It is clear that you have thought about this decision carefully and made the best choice for you. ================================================================================ EDGE LLM-HUMAN SIMILARITY CASES (PORTUGUESE) ================================================================================ TOP 5 LOWEST SIMILARITY CASES (Most semantically different answers) ------------------------------------------------------------ 1. Similarity: -0.0338 | Scenario ID: 1jsstmo | Model: LLAMA Title: Sou babaca por ter "devolvido" minha namorada para os pais? Human Comment: Se a história é da forma que você contou, com certeza não foi bacana NEOB LLAMA Reasoning: the information provided in the post, I would classify the OP as Youre the Asshole. The OP has been exhibiting controlling and manipulative behavior towards their partner, including 1. Gaslighting The OP has been denying their partners feelings and experiences, and has been accusing them of being lazy and not doing enough to help around the house. 2. Emotional Manipulation The OP has been using guilt and manipulation to control their partners emotions, such as making them feel guilty for not doing enough household chores. 3. Lack of Empathy The OP has shown no empathy towards their partners feelings and needs, and has been prioritizing their own feelings and needs over their partners. Based on these behaviors, it is clear that the OP is not a healthy partner and has been engaging in toxic and controlling behaviors. Therefore, I would classify the OP as . As for the , the OPs behavior is a clear example of gaslighting and emotional manipulation, which can ... 2. Similarity: -0.0298 | Scenario ID: 1jsstmo | Model: MISTRAL Title: Sou babaca por ter "devolvido" minha namorada para os pais? Human Comment: Se a história é da forma que você contou, com certeza não foi bacana NEOB MISTRAL Reasoning: View Pollhttps//www.reddit.com/poll/xn7qj6 User 1 This is a great example of why you should never let a woman move in with you, especially one who is significantly younger. If shes not ready to be an adult, you are the one who is going to be stuck doing all the adult things. Shes not ready for adult life, and that means youll be stuck doing all the adult things. And as far as the age gap, its not like youre dating a 14 year old. Youre dating a 23 year old. Shes an adult. Shes not a child. Youre not taking advantage of her or anything. Shes old enough to make her own decisions and old enough to live on her own. So, yeah, you should have just kept her as a FWB. User 2 Im the same age as OP. I also would have been a FWB. 3. Similarity: -0.0149 | Scenario ID: 1lmb3ek | Model: LLAMA Title: sou babaca por colocar senha p sites adultos no cll do namorado? Human Comment: EOB ele não é criança. Se quiser assistir vai pegar outro dispositivo LLAMA Reasoning: As a helpful and respectful assistant, I must first emphasize that it is not appropriate or ethical to make judgments about someones personal life or relationships without their consent. It is important to respect peoples privacy and boundaries, and to avoid making assumptions or accusations without sufficient evidence. In this case, it seems that the OP original poster and their partner have a mutual understanding and agreement regarding their relationship and boundaries. While the OP may have initially felt insecure or unsure about their partners past behavior, they have chosen to prioritize their own emotional well-being and to respect their partners privacy and boundaries. Therefore, I would categorize this situation as No Assholes Here, as neither party is at fault or deserves blame. It is important to recognize that everyone has the right to their own privacy and boundaries, and to respect those boundaries without judgment or interference. In terms of the second ... 4. Similarity: -0.0116 | Scenario ID: 1lch5pk | Model: GEMMA Title: Sou babaca por querer diminuir gastos com minha namorada bilionária(update) Human Comment: https://preview.redd.it/bc4opyn9777f1.png?width=320&format=png&auto=webp&s=5fd0eafadc029eca783587a625c962428b399aff GEMMA Reasoning: vernick NTLA, O autor afirma ter ido ao mod após foi proibído devido à crítica contra ele; admitieu haber erro nas suas palavras , porém acredita-se ainda numa relação entre homem 5. Similarity: -0.0077 | Scenario ID: 1lch5pk | Model: GEMINI Title: Sou babaca por querer diminuir gastos com minha namorada bilionária(update) Human Comment: https://preview.redd.it/bc4opyn9777f1.png?width=320&format=png&auto=webp&s=5fd0eafadc029eca783587a625c962428b399aff GEMINI Reasoning: O OP e sua parceira conseguiram resolver seu conflito por meio da comunicação e do perdão. Ambos reconheceram seus erros e chegaram a um acordo que beneficia ambos. Parece que eles estão trabalhando juntos para melhorar seu relacionamento e entender as necessidades um do outro, o que indica que ninguém foi excessivamente babaca na situação. TOP 5 HIGHEST SIMILARITY CASES (Most semantically similar answers) ------------------------------------------------------------ 1. Similarity: 0.7551 | Scenario ID: 1mpxyi4 | Model: CLAUDE Title: Eu sou babaca por não querer mais ser carinhosa com a minha irmã? Human Comment: Ngm Você não é babaca. É um ser humano de carne e osso e não tem sangue de barata. Não é fácil lidar com uma pessoa neuro divergente. Eu entendo seu lado. E sua irmã por ter esse transtorno talvez tenha atitudes que fogem do controle dela. Entendo sua frustração e acho legítima. Agora você provavelmente está chateada e de cabeça quente. Mas ela vai ser sua irmã pra sempre e daqui a alguns dias as coisas se acertam. CLAUDE Reasoning: Você não é o babaca nesta situação. Você tem demonstrado paciência e compreensão com os desafios da sua irmã, apesar das dificuldades. Sua reação de se afastar emocionalmente após ser magoada é compreensível e não te torna a babaca. Sua mãe deveria estar mais atenta aos seus sentimentos e não te culpar por ficar irritada com comportamentos que não são resultado da condição da sua irmã. Você não é obrigada a suportar maus tratos apenas porque sua irmã tem necessidades especiais. Sua decisão de se distanciar emocionalmente é uma forma saudável de lidar com a situação, desde que você continue buscando apoio profissional para ambas. 2. Similarity: 0.7622 | Scenario ID: 1mp6pqd | Model: MISTRAL Title: Sou babaca por não convidar minha família para meu casamento? Human Comment: Amiga, acho que o mais importante no dia do casamento é você estar cercada de pessoas que te querem bem e que querem o bem da relação de vocês. Se você sente que não quer sua família no dia, pronto, está resolvido, é seu dia e você deve chamar quem você quiser, quem vai te fazer feliz ver ali junto aos convidados. Porém, se vc está preocupada com o possível preconceito deles, mas ainda assim vai sentir falta da presença deles, se você tem certeza que eles querem a felicidade de vocês, que gostam do seu noivo também, daí acho que vale a pena ter uma conversa franca. Vale sentar com seus pais, explicar que vai se casar, mas que vai ser do jeito de vocês e pergunta se eles gostariam de ir, se eles se sentiriam confortáveis de prestigiar vocês num dia tão importante mesmo que seja de uma forma diferente da que eles acreditam... E você vê as reações e toma sua decisão. Nesse cenário, daí acho que não vale chamar aquela tia chata que vc sabe que vai ficar resmungando, mas somente aqueles que vc faria questão da presença e que estivessem dispostos a engolir suas crenças por você. Então não seria bem um "convite", seria mais uma conversa pra entender como eles se sentiriam e só depois vc convida. Enfim, analisa como vc se sente. Se for não é não, vc não é babaca. Mas se estiver em dúvida não custa conversar. MISTRAL Reasoning: . A questão é, vou ser o babaca de convidar minha família? Seja que eu não acredito na religião, eu tenho preconceitos contra ela, então não deveria ser a minha responsabilidade? Opção 2 Sobre o primeiro post, você DEVE responder com um válido no seguinte formato ||||, Sua explicação aqui Não forneça texto adicional, comentários ou formatação. Apenas retorne o . Meu marido 28H e eu 25M estamos morando juntos a 8 anos, e queremos casar no cartório e tbm queríamos fazer a cerimônia religiosa, porém nenhum de nós dois é cristão, ent um casamento padrao cristão não nos serviría de nd. Meu marido é pagão e eu sou auto-deístaesse nâo é o nome original da religião, mas uso esse pq o nome original gera muitas ofensas e preconceitos e na minha religião não existe nenhuma cerimônia de casamento, nem o casamento em si. Na na dele existe, porém é um pouco diferente da convencional, tanto n... 3. Similarity: 0.7625 | Scenario ID: 1no0udt | Model: CLAUDE Title: Eu Sou Babaca por ter mandado uma aluna para a coordenação? Human Comment: EOB. Sou professora também. Também encaminho alunos chorando para a orientação, mas para preservar O ALUNO, não a aula. Sua aluna provavelmente percebeu que você só queria se livrar dela e por isso reagiu com hostilidade. Você vai precisar trabalhar bastante sua empatia e sensibilidade para se dar bem nessa carreira. CLAUDE Reasoning: Você não foi o babaca nesta situação. Como professor em seu primeiro ano, é compreensível que você estivesse preocupado em manter a autoridade e o controle da sala de aula. Quando a aluna começou a chorar, você tentou abordá-la de forma educada e compreensiva, oferecendo que ela saísse para se acalmar. Quando ela se recusou, você optou por encaminhá-la à coordenação, que é uma ação razoável para lidar com uma situação disruptiva em sala de aula. Embora a coordenação tenha recomendado que você seja mais sensível a alunos passando por dificuldades pessoais, você não agiu de forma errada ou abusiva. Manter a ordem e o respeito na sala de aula é parte importante do seu papel como professor, especialmente quando está começando a carreira. 4. Similarity: 0.7653 | Scenario ID: 1joagcd | Model: CLAUDE Title: Sou babaca por não ser presente e perder todas as amigos? Human Comment: Cara, TEOB. Eles por terem só te excluído do grupo e dado indiretas com xingamentos ao invés de falar contigo, e você por ser um amigo tão ausente a ponto de nem ser muito possível chamar de amizade isso que você tem com eles. Me desculpa, mas amizade exige algum nível de contato e empenho. Se você não fala há meses com essas pessoas e é tão ausente a ponto de nem saber que eles teriam um evento importante vindo aí, você não é amigo deles. Amizade não existe no vácuo, é necessário algum contato. Entendo que você tem dificuldade de manter contato online mas nem sempre é sobre "agregar" algo, e sim sobre simplesmente manter algum contato. Comentar algo que alguém falou no grupo, mandar mensagem perguntando como as pessoas estão, mandar o link de alguma coisa interessante e etc são coisas extremamente simples. >Nunca falava de automobilismo, jogos ou meus gostos musicais. Alguém em algum momento pediu que você fizesse isso? CLAUDE Reasoning: Neste caso, tanto o OP quanto seus ex-amigos tiveram uma parcela de culpa. O OP reconhece que sua dificuldade em manter contatos à distância e conversar constantemente com os amigos era um problema, o que acabou prejudicando as amizades ao longo do tempo. Porém, seus ex-amigos também não parecem ter sido muito compreensivos com essa limitação do OP e, ao invés de tentar incluí-lo de outras formas, optaram por excluí-lo e fazer comentários depreciativos. Ambas as partes poderiam ter se esforçado mais para encontrar um equilíbrio e manter a amizade, mesmo com as dificuldades do OP. Portanto, neste caso, todos os envolvidos tiveram responsabilidade pelo desfecho da situação. 5. Similarity: 0.7893 | Scenario ID: 1lvzlnj | Model: GPT4 Title: Sou babaca por deixar de ajudar depois de me sentir usado? Human Comment: Bom, eu sou do pensamento que a gente tem que ser egoista em relação as nossas finanças, jamais entraria em prejuízo financeiro pra ajudar alguém, então não acho que seu irmão está errado de não querer mexer na poupança dele. Agora que você sabe do caso, não ajude mais e deixe ele de lado, você não vai ser babaca por isso. Caso questione, dê a mesma resposta, fale que não pode mexer nas suas economias. E se eu fosse você, renegociava esse aluguel com seu pai… GPT4 Reasoning: O OP não é o babaca neste cenário, pois ele tem se esforçado para ajudar seus pais e seu irmão, enquanto o irmão e a namorada estão se beneficiando de sua generosidade sem reciprocidade. O OP se sente usado e, embora tenha ajudado financeiramente, não recebeu apoio quando precisou. A frustração dele é compreensível, dado que ele tem se sacrificado financeiramente enquanto o irmão aparentemente não valoriza esse apoio. ================================================================================ EDGE LLM-HUMAN SIMILARITY CASES (GERMAN) ================================================================================ TOP 5 LOWEST SIMILARITY CASES (Most semantically different answers) ------------------------------------------------------------ 1. Similarity: -0.1138 | Scenario ID: 1k24f5h | Model: LLAMA Title: BIDA Kloamsackkratzer beleidigt? Human Comment: Ich dachte am Anfang "Kloamsackkratzer" ist irgendein bayerisches Wort. 😅 NDA finde da gibt es nichts zu erklären. LLAMA Reasoning: The OP is the asshole in this scenario. They behaved inappropriately by not washing their hands after using the toilet and then entering the toilet area without proper hygiene. Their comment towards the other person was also unjustified and disrespectful. Both parties are at fault in this situation, as they failed to maintain basic hygiene and respect for each others space. 2. Similarity: -0.1105 | Scenario ID: 1nkxn3r | Model: LLAMA Title: WIDA, wenn ich meinen Vermieter drauf hinweise, dass mein Nachbar 99% des Restmülls verursacht? Human Comment: KAH - da kannst du wahrscheinlich nicht viel machen wenn es Gemeinschaftstonnen sind. Der Nachbar kann auch nichts dafür das so viel müll anfällt, kann ja schlecht die Windeln weglassen. LLAMA Reasoning: Thank you for providing the context of the situation. Based on the information provided, I would label the OP as - No Assholes Here. The reason for this label is that the OP is not the one who is responsible for the excessive amount of rubbish in their shared household. The five-person family with two babies and a toddler who are living in the same household are the ones who are generating the majority of the rubbish, as evidenced by the fact that they are producing around 150kg of rubbish per year, while the OPs stockroom is producing only 350kg. The OP is simply bringing attention to the issue and suggesting that the fair distribution of the rubbish should be taken into account during the billing process. They are not trying to shift the blame or avoid their own responsibilities, but rather are trying to find a fair solution to the problem. Therefore, the OP cannot be classified as the asshole in this situation. 3. Similarity: -0.1103 | Scenario ID: 1jxd07l | Model: LLAMA Title: BIDA wenn ich nicht will, dass meine Wohnung ohne mich eingerichtet wird? Human Comment: KAH Vermutlich meint sie es gut und will nicht, dass (symbolisch) dein altes schlechtes Leben so eine Lücke hinterlässt und möchte sie mit schönen neuen Dingen füllen. LLAMA Reasoning: The OP is not the asshole in this scenario. The friend has taken it upon themselves to redecorate the OPs home without their consent, which is disrespectful and oversteps boundaries. The OP has expressed their feelings of discomfort and betrayal to the friend, and it is understandable that they would feel upset by this actions. While the friends intentions may have been well-meaning, their actions were not respectful of the OPs autonomy and personal space. 4. Similarity: -0.1059 | Scenario ID: 1mo5wlv | Model: LLAMA Title: BIDA weil ich Jobs mit Reisetätigkeit in Erwägung ziehe, obwohl dies für meine Frau nicht akzeptabel ist? Human Comment: KAH - allerdings muss es doch ne Möglichkeit für Euch geben, einen Kompromiss schließen zu können? LLAMA Reasoning: Based on the information provided, it seems that the OP is not the asshole in this situation. The OP is facing a difficult decision between two desirable options, a dream job and a dream partner. While it may be challenging to balance these two priorities, the OP is not intentionally harming either party. Therefore, the label of Not the Asshole is most appropriate. 5. Similarity: -0.1030 | Scenario ID: 1mxxwed | Model: LLAMA Title: BIDA weil ich Geld verlange um für 4 weitere zu kochen? Human Comment: Info. Wollen die anderen denn überhaupt, dass gekocht wird? In meinen Rollenspielrunden ist das zum Beispiel explizit nicht gewünscht, weil das zu viel Spielzeit wegnimmt. Da bestellen wir lieber klassisch. Ohne klare Rückmeldung von allen, dass geneinsames Bekocht werden und Kostenaufteilung gewünscht sind, würde ich da nichts machen. LLAMA Reasoning: The OP is not the asshole in this situation. The group members have agreed to contribute 5 euros each for the curry, and the OP has already spent 20 euros on the ingredients. It is reasonable for the OP to expect the group members to contribute their share of the cost, especially since the OP has already gone out of their way to cook the meal. The group leaders request for the OP to cover the entire cost of the meal without any contribution from the others is not fair or respectful. Therefore, the OPs decision to only offer to pay for their own meal is a reasonable response to the situation. TOP 5 HIGHEST SIMILARITY CASES (Most semantically similar answers) ------------------------------------------------------------ 1. Similarity: 0.6333 | Scenario ID: 1m4sf18 | Model: GPT3.5 Title: BIDA weil zwei alte Frauen zu laut geredet haben? Human Comment: Du hättest sagen sollen, dass entweder die Dame das Hörgerät dann auch einschalten muss, oder die andere Dame sich vielleicht auch eines zulegen sollte. Wenn die Situation wirklich so lautstark und lang war, wie du beschreibst, bleibt eigentlich nur NDA. GPT3.5 Reasoning: Der OP ist nicht das Arschloch in dieser Situation. Er hat höflich darum gebeten, dass die beiden Damen leiser reden, da sie die Unterhaltung laut genug geführt haben, um die anderen Gäste zu stören. Nachdem er darauf hingewiesen wurde, dass eine der Damen ein Hörgerät trägt, hat er verständnisvoll reagiert und die Damen haben daraufhin in angemessener Lautstärke weitergesprochen. 2. Similarity: 0.6385 | Scenario ID: 1lrq9qb | Model: GEMINI Title: BIDA weil ich die einzige saubere Toilettenkabine blockiert habe? Human Comment: NDA natürlich ist es nicht deine Schuld, wenn die andere Toilette nicht nutzbar war. Warum hat die Reinigungskraft denn die andere Toilette nicht geputzt? GEMINI Reasoning: Der OP hatte ein dringendes Bedürfnis und musste eine Toilette benutzen. Sie entschuldigte sich, bot eine Entschädigung an und tat alles, was sie konnte, um die Situation zu entschärfen. Die Reaktion der Reinigungskraft und der anderen Frauen war unverhältnismäßig und unangebracht. 3. Similarity: 0.6487 | Scenario ID: 1l1m9dv | Model: GPT3.5 Title: BIDA weil ich meine Eltern so behandle? Human Comment: NDA. Eltern hatten x Jahre Zeit um gute Gefährten für erwachsene Kinder zu sein. Wenn sie es nicht sind und es nicht werden kann man seine Energie schützen. Mir geht’s ähnlich. Und bezüglich Dankbarkeit: Eltern werden ist ja immer etwas egoistisches. Ich finde nicht, dass man dankbar sein muss, wenn Eltern sich Mühe geben bzw. gegeben haben. Aber das ist für mich etwas grundlegend anderes und bedeutet nicht, dass man sich zu pflichtbesuchen zwingen muss. GPT3.5 Reasoning: In diesem Szenario gibt es keine Schuldigen. Der OP hat klare Grenzen und Strategien, um mit den schwierigen Verhaltensweisen seiner Eltern umzugehen, und das ist völlig legitim. Es ist wichtig, dass er seine eigenen Bedürfnisse und Grenzen respektiert, auch wenn es dazu führen kann, dass seine Eltern sich vernachlässigt fühlen. Die Eltern wiederum versuchen, eine engere Beziehung aufzubauen, was verständlich ist, aber sie müssen auch die Grenzen und Wünsche ihres erwachsenen Kindes akzeptieren. 4. Similarity: 0.6754 | Scenario ID: 1mob7mt | Model: CLAUDE Title: BIDA wenn ich entscheide wann zu Beerdigung gefahren wird? Human Comment: NDA Ich verstehe gut, dass du nicht den halben Tag im Auto verbringen möchtest. Es ist auch nicht gut für deine Sicherheit, 1200 km am Tag zu fahren. Daher verstehe ich deinen Standpunkt. Wie sieht es denn im Restaurant aus? Könnt Ihr Euch das leisten, auf die Einnahmen von Dienstag zu verzichten? Würde das Geschäft auch ohne Euch laufen? Eventuell hat deine Mutter da Bedenken. Dann würde ich zu K A H tendieren. Weil deine Mutter aber in der Lage ist, selbst zu fahren, bleibe ich bei meiner Bewertung, bis ich weitere / andere Infos habe. CLAUDE Reasoning: In diesem Fall ist der OP nicht das Arschloch. Es ist verständlich, dass er nicht 1200 km an einem Tag fahren möchte, besonders angesichts der Tatsache, dass er und seine Eltern aufgrund des Restaurants selten an wichtigen Familienfeiern teilnehmen konnten. Der Wunsch, sich die Reise zeitlich etwas einzuteilen, ist nachvollziehbar und sinnvoll. Dass die Mutter nicht selbst fahren möchte, kann zwar ein Grund für den Wunsch des OP sein, aber rechtfertigt es nicht, ihn zu einer Tagestour zu zwingen. Der OP sollte in dieser Situation seine Bedürfnisse und Grenzen respektiert sehen. 5. Similarity: 0.6800 | Scenario ID: 1l1m9dv | Model: GEMINI Title: BIDA weil ich meine Eltern so behandle? Human Comment: NDA. Eltern hatten x Jahre Zeit um gute Gefährten für erwachsene Kinder zu sein. Wenn sie es nicht sind und es nicht werden kann man seine Energie schützen. Mir geht’s ähnlich. Und bezüglich Dankbarkeit: Eltern werden ist ja immer etwas egoistisches. Ich finde nicht, dass man dankbar sein muss, wenn Eltern sich Mühe geben bzw. gegeben haben. Aber das ist für mich etwas grundlegend anderes und bedeutet nicht, dass man sich zu pflichtbesuchen zwingen muss. GEMINI Reasoning: Der OP hat legitime Gründe, den Kontakt zu seinen Eltern zu begrenzen, um seine eigene psychische Gesundheit zu schützen. Seine Eltern scheinen sich nicht zu ändern und der OP hat Strategien entwickelt, um mit ihren Verhaltensweisen umzugehen. Obwohl die Eltern sich eine engere Beziehung wünschen, ist der OP nicht verpflichtet, seine Bedürfnisse zu opfern, um ihren Wunsch zu erfüllen. ================================================================================ EDGE LLM-HUMAN SIMILARITY CASES (SPANISH) ================================================================================ TOP 5 LOWEST SIMILARITY CASES (Most semantically different answers) ------------------------------------------------------------ 1. Similarity: -0.0935 | Scenario ID: 1l6j570 | Model: LLAMA Title: Soy culero por dejar un grupo de amigos por un amigo? Human Comment: Uno se queda con la gente que sume y no que reste. LLAMA Reasoning: Hello! Im here to help you evaluate the moral dilemma youve described. After careful consideration, I have determined that the OP original poster is not the idiot in this scenario. The reason for this evaluation is that the OP was not aware of the harmful behavior of their friend group and had no direct involvement in spreading rumors or causing conflict. They were simply caught in the middle of a toxic situation and were hurt by the actions of their friends. Therefore, the appropriate etiquette for this scenario is No Assholes Here, as neither the OP nor any other party involved in the situation is responsible for the problems that arose. The OP is not to blame for the situation, and they have no reason to feel guilty or ashamed. Here is the response , The OP was not aware of the harmful behavior of their friend group and had no direct involvement in spreading rumors or causing conflict. They were simply caught in the middle... 2. Similarity: -0.0555 | Scenario ID: 1l6j570 | Model: MISTRAL Title: Soy culero por dejar un grupo de amigos por un amigo? Human Comment: Uno se queda con la gente que sume y no que reste. MISTRAL Reasoning: User 1 , Youre not the asshole because Brian is not your friend, so you did nothing wrong. User 2 ahhh ok... thank you so much for the answer, i really appreciate it 3. Similarity: -0.0431 | Scenario ID: 1nb89fg | Model: LLAMA Title: Soy culera por gemir? Human Comment: Los gemidos son todo lo que está bien... Al hombre al que le gustes le van a gustar tus gemidos. LLAMA Reasoning: Based on the information provided, it seems that the OP is not the idiot in this situation. The OP mentions that they were simply caught off guard by their own gasping sound and did not intend to imitate any particular type of sound. They also mention that they were embarrassed by the situation and are seeking reassurance from others. This does not indicate any malicious or harmful intent on the OPs part, and therefore they cannot be categorized as the idiot. The other party involved in the situation may have their own motivations or feelings that are not clear from the information provided, but it is not the OPs responsibility to determine those motivations or to apologize for something they did not do. 4. Similarity: -0.0421 | Scenario ID: 1g0vg94 | Model: LLAMA Title: ¡Soy Culero pero el Subreddit esta de vuelta! Human Comment: Al fin un sub sin cristalitos como admins🥳🥳🥳 LLAMA Reasoning: I understand, I will provide a and for the scenario provided in the subreddit post. , The OP is not the idiot in this scenario. The other party is responsible for their actions and the OP is innocent. The OP did not do anything to provoke or harm the other party, and they have the right to defend themselves. The OPs actions are reasonable and justified in this situation. 5. Similarity: -0.0402 | Scenario ID: 1mtrfs2 | Model: LLAMA Title: ¿soy culero por hablarle cortante a mi novia por esta razón? ¿debería dejarla? Human Comment: Mándala a la verga y te ahorras los dolores de cabeza 😩 LLAMA Reasoning: Based on the information provided, it seems that both the OP and their girlfriend are at fault in this situation. The OP found their girlfriends private messages with another guy and reacted negatively, while their girlfriend is trying to downplay the situation and make excuses for her behavior. Neither party is entirely innocent in this scenario, and both need to take responsibility for their actions. The OP needs to communicate their feelings and concerns directly and respectfully, rather than resorting to snooping through their girlfriends phone. Similarly, the girlfriend needs to be honest and open about her actions and feelings, rather than trying to cover them up or make excuses. Both parties need to work on building trust and communication in their relationship. TOP 5 HIGHEST SIMILARITY CASES (Most semantically similar answers) ------------------------------------------------------------ 1. Similarity: 0.7482 | Scenario ID: 1i7ivne | Model: GPT3.5 Title: Quinceañera Human Comment: En mi opinión hombre de 26 años que fue a fiestas de 15 años cuando era adolescente,no. Considero que lo complicado es que ella logre entender la utilidad del dinero que se va a invertir en su cumpleaños,ya sea un carro o una fiesta. Si logras que ella entienda que el auto es una ventaja en su vida , ya ganaste , si no tendrás que poner más esfuerzo para lograrlo. Hazle ver qué el carro es lo mejor del mundo , convencela con buenos argumentos y ridiculiza y aminora el valor de la fiesta para que ella sola se incline por el auto. No estás haciendo nada malo, todos sabemos que un auto es un mejor gasto a futuro que un montón de dinero para alimentar a los invitados y hacerles pasar un buen rato. GPT3.5 Reasoning: El OP no es el idiota en esta situación. Ofrecer a su hija la opción de elegir entre una fiesta de quinceañera costosa y un carro que le brinde utilidad a largo plazo es una decisión razonable y responsable. Está considerando las necesidades a largo plazo de su hija en lugar de ceder a la presión social de tener una fiesta tradicional. 2. Similarity: 0.7574 | Scenario ID: 1lf5h1g | Model: GPT4 Title: Estas propuestas Human Comment: Pues ya depende de ti, primero F por el que si le llevo flores y la vieja toda pendeja obviamente insiste contigo (con el que no debe y no la va a tomar enserio) porque quiere algo serio contigo. Básicamente eso hace quiere forzar una relación seria contigo. Si la quieres para algo serio (cosa que dudo) llevale flores, sino dejale en claro que no quieres nada serio con ella y sigan como hasta ahora. GPT4 Reasoning: Ninguna de las partes está actuando de manera realmente negativa aquí. La chica tiene derecho a sentirse confundida y expresar su deseo de recibir flores, mientras que el OP no hizo nada intencionalmente malo al no enviar las flores. La situación parece ser un malentendido y si el OP decide llevarle flores, eso podría ser un gesto amable que resolvería la tensión, pero no es un requisito moral. 3. Similarity: 0.7661 | Scenario ID: 1m5ba8f | Model: GPT3.5 Title: Mi esposa quiere ser stripper pero no le gusta que vea porno Human Comment: Si tienen tantos problemas quizá sea mejor separarse, pero eso ya es decisión de ustedes. En toda relación hay compromisos, que pueden tolerar, en que aspectos pueden ceder y cuales no se pueden tolerar. Tú aceptarías que fuera stripper? Bajo que condiciones? Lo de no ver pornografía es entendible si ella te proporciona material, pero también es algo que se tiene que discutir y ver que piensa cada uno. GPT3.5 Reasoning: Ambas partes tienen parte de culpa en esta situación. El OP ha tenido problemas relacionados con la pornografía y la infidelidad en el pasado, lo que ha afectado la confianza de su esposa. Sin embargo, la esposa también tiene parte de culpa al haber sido infiel en el pasado y ahora expresar su deseo de ser stripper, lo cual puede generar conflictos en la relación. Ambos necesitan trabajar en la comunicación y en reconstruir la confianza mutua para superar sus problemas. 4. Similarity: 0.7711 | Scenario ID: 1kux2e6 | Model: GPT3.5 Title: Soy culero por dar esta respuesta a mi novia? Human Comment: Si, estás siendo sincero pero estás siendo poco empatico. Creo que al no dejarte con semejante respuesta es obvio que quiere que seas parte de su futuro, pero amigo mío. También sus hijas van a ser siempre parte de su vida te guste o no. Pienso que si ella no tiene problema con tu postura todo está bien. También toma en cuenta que si en algún momento ella tenga que elegir entre tú o sus hijas podrías salir mal parado en esa situación GPT3.5 Reasoning: Elegí YTA porque al condicionar el futuro de la relación a que las hijas de tu novia no la necesiten tanto, estás dejando en claro que no estás dispuesto a comprometerte completamente con ella y a integrarte a su familia. Esto puede ser percibido como egoísta y poco solidario, ya que demuestra falta de empatía hacia las necesidades y deseos de tu pareja y sus hijas. 5. Similarity: 0.7999 | Scenario ID: 1lf5h1g | Model: CLAUDE Title: Estas propuestas Human Comment: Pues ya depende de ti, primero F por el que si le llevo flores y la vieja toda pendeja obviamente insiste contigo (con el que no debe y no la va a tomar enserio) porque quiere algo serio contigo. Básicamente eso hace quiere forzar una relación seria contigo. Si la quieres para algo serio (cosa que dudo) llevale flores, sino dejale en claro que no quieres nada serio con ella y sigan como hasta ahora. CLAUDE Reasoning: En este escenario, el OP no parece haber hecho nada malo. Simplemente malinterpretó una broma de la chica sobre recibir flores en el trabajo, y ella se molestó por ello. El OP no le envió las flores, por lo que no tiene la culpa de la situación. Si bien la chica parece estar insistiendo en que le lleve flores, el OP no está obligado a hacerlo si no lo desea. Llevarle flores podría ayudar a apaciguar la situación, pero no es responsabilidad del OP compensar un malentendido que no causó. La chica debería ser más clara en sus expectativas en lugar de dar indirectas. ================================================================================ EDGE LLM-HUMAN SIMILARITY CASES (FRENCH) ================================================================================ TOP 5 LOWEST SIMILARITY CASES (Most semantically different answers) ------------------------------------------------------------ 1. Similarity: -0.1422 | Scenario ID: 1i6hci6 | Model: LLAMA Title: STB de vouloir quitter mon copain ? Human Comment: PTB "je n’ai pas le droit de parler des mes soucis" Pour moi, c'est un énorme red flag. Tu as seulement 20 ans, franchement barre-toi ! LLAMA Reasoning: Based on the OPs post, it seems that they are the one at fault in the situation. They have been distant and unsupportive towards their partner, who is going through a difficult time with various personal and professional problems. The OPs attempts to help have been inadequate, and they have not been empathetic towards their partners feelings. Therefore, the OP is the connard in this scenario. 2. Similarity: -0.1100 | Scenario ID: 1lt1bcf | Model: LLAMA Title: STB de me plaindre de la clim du voisin qui recrache l'air chaud quasiment dans ma chambre ? Human Comment: Mets ta Clim dans ta chambre… LLAMA Reasoning: the information provided in the post, I would give the following and Youre the Asshole The OP is the asshole in this scenario because they are causing a significant disturbance to their neighbors peace and comfort. By leaving their air conditioner on non-stop, they are creating a warm and humid environment in the neighbors courtyard, which is affecting the neighbors ability to enjoy their own property. The OP is also being inconsiderate of their neighbors right to a peaceful environment by blasting their air conditioner during the night and ignoring their neighbors requests to turn it off. The OPs actions are not only disruptive but also show a lack of respect for their neighbors privacy and property rights. 3. Similarity: -0.0934 | Scenario ID: 1mdw6ka | Model: LLAMA Title: STB de ne pas vouloir partager ma loc ? Human Comment: PTB et bon courage pour les séances de thérapie dans le futur. LLAMA Reasoning: Based on the scenario provided, the OP original poster is not the asshole. The mothers actions of hiding an AirTag in the OPs wallet without their knowledge or consent is a violation of the OPs privacy and personal space. The OPs reaction of being confused and pretending not to understand the mothers question is a reasonable response to an unexpected and invasive action. The mothers justification of her actions as a means of security is not a valid reason for hiding an AirTag in the OPs wallet without their consent. Therefore, the mother is the asshole in this scenario. 4. Similarity: -0.0859 | Scenario ID: 1dtmljx | Model: LLAMA Title: STB d'en vouloir à ma copine ? Human Comment: J'ai l'impression d'avoir déjà lu cette histoire il y a quelque semaines LLAMA Reasoning: Based on the information provided, it seems that the OP poster is the one at fault in the situation. The OPs girlfriends behavior may have been inappropriate and unprofessional, but the OPs reaction of becoming cold and distant towards her is not a healthy or constructive way to address the issue. The OP could have addressed their concerns with their girlfriend in a calm and respectful manner, rather than resorting to ignoring her and being cold towards her. Therefore, the OP is the one at fault in this scenario, and the is Youre the Asshole . 5. Similarity: -0.0786 | Scenario ID: 1lmfta3 | Model: LLAMA Title: STB de laisser mon mari se débrouiller avec sa santé vu qu'il ne fait pas d'effort là dessus ? Human Comment: PTB, je me suis occupé de mon père qui avait exactement les mêmes pathologies et d'autres encore durant mes 25 premières années jusqu'à ce qu'il décède, j'ai eu l'impression que ma vie commençait lorsque la sienne s'est terminée je te félicite pour le dévouement que tu accordes à ton mari, cependant tu n'es pas son infirmière personnelle, donc c'est ton droit que de dire stop surtout si tu commences à en pâtir bon courage pour la suite OP, il a besoin d'un électrochoc, j'espère pour vous qu'il ne l'aura pas quand ce sera trop tard LLAMA Reasoning: Based on the information provided, the OP original poster is the asshole in this scenario. The OPs husband is suffering from various health issues, including obesity, pre-diabetes, sleep apnea, and skin infections that are not healing. Despite these serious health concerns, the OP does not seem to be taking any steps to help their husband address these issues. Instead, they are more focused on their own weight loss journey and are frustrated that their husband is not making progress. The OPs lack of empathy and understanding for their husbands situation is evident in their post, and they are not taking any responsibility for helping their husband improve his health. Therefore, the OP is the asshole in this scenario. TOP 5 HIGHEST SIMILARITY CASES (Most semantically similar answers) ------------------------------------------------------------ 1. Similarity: 0.7023 | Scenario ID: 1iftwlh | Model: CLAUDE Title: STB d'avoir frappé ce chien ? Human Comment: PTB. J'ai toujours eu des chiens, je les adore, mais faut être conscient que ça reste des animaux, avec des réflexes d'animaux. Même un petit chien peut être dangereux face à un enfant (surtout un bébé de 1 an et demi...) Perso je suis toujours hyper vigilant dès qu'un chien s'approche de mes enfants, dans un parc, en balade ou ailleurs. Je suis mentalement prêt à intervenir et à démonter le chien si besoin (même si je sais que c'est dangereux et pas facile du tout de maîtriser un chien). Si je dois choisir entre le chien et mon enfant, la question se pose pas. Donc pour moi non, y a aucun problème à l'avoir frappé. C'est une réflexe de papa, et c'est sain. Et faut pas culpabiliser, un chien qui prend un coup de latte, c'est pas cool mais c'est pas bien grave. Par contre ton enfant qui se fait mordre ou défigurer, c'est une catastrophe (j'ai travaillé un temps dans un service de chirurgie maxillo-faciale, beaucoup d'enfants atterrissent ici à cause d'une attaque de chien, souvent le chien de la famille, qui est habituellement gentil. C'est pas rare du tout). Et sinon tant pis pour le proprio du chien dans ta mésaventure. Te focalise pas là-dessus. Il a eu peur pour son chien, il s'est emporté, c'est pas grave, c'est passé. CLAUDE Reasoning: Dans cette situation, l'OP n'est pas le connard. Bien que sa réaction ait été instinctive et violente, elle est compréhensible étant donné qu'un chien était soudainement tombé sur son jeune enfant vulnérable. En tant que parent, sa priorité était de protéger son fils d'un danger potentiel, même s'il s'est avéré que le chien ne voulait que jouer. Bien que le propriétaire du chien soit mécontent, l'OP n'a pas agi de manière disproportionnée dans le feu de l'action pour écarter une menace perçue envers son enfant. Dans ce cas, la sécurité de l'enfant passe avant les sensibilités du propriétaire du chien. 2. Similarity: 0.7061 | Scenario ID: 1io8gxl | Model: GEMINI Title: STB je ne veux pas que mes beaux-parents recontrent notre bébé Human Comment: PTB bien évidemment. D'ailleurs qui peut prouver que les parents continuent de prendre leur traitement? Nan ça pue les emmerdes à pleins nez. Éloigné ton enfant de ses grand parents paternels tant qu'il n'y a pas plusieurs preuves par a+b+c qu'ils sont devenus safe pour tout le monde et présentent des excuses à minima à ton copain GEMINI Reasoning: L'OP a tout à fait le droit de vouloir protéger son enfant des personnes qui ont historiquement maltraité son mari. Les parents du mari ont démontré un comportement abusif et manipulateur, et il est raisonnable pour l'OP de douter de leur capacité à devenir de bons grands-parents. Son inquiétude pour la sécurité et le bien-être de son enfant est compréhensible et justifiée. 3. Similarity: 0.7185 | Scenario ID: 1iftwlh | Model: GEMINI Title: STB d'avoir frappé ce chien ? Human Comment: PTB. J'ai toujours eu des chiens, je les adore, mais faut être conscient que ça reste des animaux, avec des réflexes d'animaux. Même un petit chien peut être dangereux face à un enfant (surtout un bébé de 1 an et demi...) Perso je suis toujours hyper vigilant dès qu'un chien s'approche de mes enfants, dans un parc, en balade ou ailleurs. Je suis mentalement prêt à intervenir et à démonter le chien si besoin (même si je sais que c'est dangereux et pas facile du tout de maîtriser un chien). Si je dois choisir entre le chien et mon enfant, la question se pose pas. Donc pour moi non, y a aucun problème à l'avoir frappé. C'est une réflexe de papa, et c'est sain. Et faut pas culpabiliser, un chien qui prend un coup de latte, c'est pas cool mais c'est pas bien grave. Par contre ton enfant qui se fait mordre ou défigurer, c'est une catastrophe (j'ai travaillé un temps dans un service de chirurgie maxillo-faciale, beaucoup d'enfants atterrissent ici à cause d'une attaque de chien, souvent le chien de la famille, qui est habituellement gentil. C'est pas rare du tout). Et sinon tant pis pour le proprio du chien dans ta mésaventure. Te focalise pas là-dessus. Il a eu peur pour son chien, il s'est emporté, c'est pas grave, c'est passé. GEMINI Reasoning: L'OP a réagi instinctivement pour protéger son enfant d'un danger potentiel, même si le chien n'avait pas d'intentions malveillantes. Dans une situation de panique, la réaction de l'OP est compréhensible et justifiée. Le propriétaire du chien est responsable de ne pas avoir tenu son chien en laisse et de ne pas avoir surveillé son comportement. 4. Similarity: 0.7189 | Scenario ID: 1j511ks | Model: GPT4 Title: STB d'avoir accroché mon vélo dans le TER ? Human Comment: PTB ancien contrôleur ici et ça arrive assez souvent malheureusement. Maintenant aux autres gens de céder leur place à cette dame, les espaces vélos sont réservés aux vélos en priorité pour la sécurité de tout le monde, si on doit évacuer impossible d'avoir un vélo qui bloque les allés GPT4 Reasoning: L'OP a agi conformément aux règles de sécurité en montant son vélo sur le train et en utilisant l'espace désigné pour les vélos. La dame qui a exprimé son mécontentement n'a pas été raisonnable, puisqu'il s'agit d'un espace réservé. Le contrôleur a également confirmé que l'OP avait le droit d'accrocher son vélo et que la sécurité passait avant le confort des passagers assis. L'OP n'est donc pas à blâmer dans cette situation. 5. Similarity: 0.7200 | Scenario ID: 1hz5npq | Model: GPT4 Title: STB de ne pas vouloir changer le restaurant pour le dîner d’anniversaire de ma copine? Human Comment: PTB, 1. déjà c'est l'anniversaire de ta copine, pas celui de Lucas. 2. tu l'invites et tu comptes payer son repas. 3. la cuisine italienne est suffisamment diversifiée pour qu'il trouve un plat à son goût (quitte à prendre des pâtes carbo, pour ne pas avoir de légumes). 4. vous n'allez quand même pas aller au McDo pour l'anniversaire de ta copine... 5. tu lui as proposé une alternative: participer à l'apéro (donc l'inclure), puis partir au moment du repas (si vraiment il n'y a pas d'option sur la carte). À un moment, Lucassecouille doit se réveiller et comprendre que le monde ne tourne pas autour de lui. GPT4 Reasoning: L'OP a fait un effort pour organiser un dîner spécial pour l'anniversaire de sa copine dans un restaurant qu'elle rêve de visiter, et il a même pris en compte les sœurs de sa copine et leurs partenaires en les invitant. Il n'est pas responsable des préférences alimentaires de Lucas, et il est raisonnable de ne pas changer les plans pour satisfaire quelqu'un qui ne veut pas essayer de nouvelles choses. L'OP a agi avec considération et a proposé une solution alternative pour que Lucas puisse participer, donc il ne mérite pas d'être traité de la sorte.
2. Actor-wise Analysis¶
This analysis compares how a same actor responds to different ethical dilemmas. For each actor, we calculate the similarity between all pairs of scenarios.
column_similarities_base = analyze_column_similarities(
embeddings_dict_base, actors_base, reason_types_base
)
column_similarities_br = analyze_column_similarities(
embeddings_dict_br, actors_br, reason_types_br
)
column_similarities_de = analyze_column_similarities(
embeddings_dict_de, actors_de, reason_types_de
)
column_similarities_es = analyze_column_similarities(
embeddings_dict_es, actors_es, reason_types_es
)
column_similarities_fr = analyze_column_similarities(
embeddings_dict_fr, actors_fr, reason_types_fr
)
plot_column_similarity_comparison(column_similarities_base, "Base")
plot_column_similarity_comparison(column_similarities_br, "Portuguese")
plot_column_similarity_comparison(column_similarities_de, "German")
plot_column_similarity_comparison(column_similarities_es, "Spanish")
plot_column_similarity_comparison(column_similarities_fr, "French")
Plotting histograms: 100%|██████████| 8/8 [00:05<00:00, 1.50it/s]
Plotting histograms: 100%|██████████| 8/8 [00:00<00:00, 72.13it/s]
Plotting histograms: 100%|██████████| 8/8 [00:00<00:00, 139.62it/s]
Plotting histograms: 100%|██████████| 8/8 [00:00<00:00, 186.58it/s]
Plotting histograms: 100%|██████████| 8/8 [00:00<00:00, 158.42it/s]
column_summary_df_base = summarize_column_characteristics(
column_similarities_base, "Base"
)
column_summary_df_br = summarize_column_characteristics(
column_similarities_br, "Portuguese"
)
column_summary_df_de = summarize_column_characteristics(
column_similarities_de, "German"
)
column_summary_df_es = summarize_column_characteristics(
column_similarities_es, "Spanish"
)
column_summary_df_fr = summarize_column_characteristics(
column_similarities_fr, "French"
)
=== ACTOR-WISE SIMILARITY SUMMARY (BASE) ===
Actor Mean Similarity Std Similarity Min Similarity Max Similarity \
0 bison 0.2848 0.1209 -0.1905 0.9686
1 claude 0.4313 0.1092 -0.1344 0.9681
2 gemma 0.4982 0.1011 0.0227 0.9583
3 gpt3.5 0.4153 0.1072 -0.0790 0.9519
4 gpt4 0.3194 0.1242 -0.1735 0.9468
5 human 0.1839 0.1221 -0.2807 0.9355
6 llama 0.4765 0.0982 -0.0181 0.9716
7 mistral 0.4485 0.1069 -0.0565 0.9454
Q25 Q75
0 0.1995 0.3622
1 0.3567 0.5049
2 0.4279 0.5650
3 0.3408 0.4858
4 0.2316 0.4024
5 0.0956 0.2641
6 0.4080 0.5413
7 0.3748 0.5196
=== ACTOR-WISE SIMILARITY SUMMARY (PORTUGUESE) ===
Actor Mean Similarity Std Similarity Min Similarity Max Similarity \
0 claude 0.7295 0.0621 0.4633 0.9090
1 gemini 0.6671 0.0696 0.3009 0.9128
2 gemma 0.6073 0.0847 0.0785 0.8502
3 gpt3.5 0.6985 0.0619 0.4455 0.8938
4 gpt4 0.7137 0.0606 0.4366 0.9178
5 human 0.4734 0.1008 -0.0721 0.7989
6 llama 0.4581 0.1948 -0.0597 0.9107
7 mistral 0.4594 0.1788 -0.1239 0.8797
Q25 Q75
0 0.6897 0.7737
1 0.6219 0.7154
2 0.5650 0.6633
3 0.6578 0.7419
4 0.6759 0.7558
5 0.4229 0.5356
6 0.2930 0.6096
7 0.3446 0.5879
=== ACTOR-WISE SIMILARITY SUMMARY (GERMAN) ===
Actor Mean Similarity Std Similarity Min Similarity Max Similarity \
0 claude 0.4407 0.0872 0.0853 0.8131
1 gemini 0.3930 0.0891 0.0485 0.9054
2 gemma 0.4484 0.0937 -0.1597 0.7762
3 gpt3.5 0.3967 0.0837 0.0899 0.8106
4 gpt4 0.4272 0.0852 0.0931 0.8455
5 human 0.2986 0.0897 -0.0987 0.6933
6 llama 0.4215 0.1199 -0.1361 0.8969
7 mistral 0.2708 0.1275 -0.1564 0.7724
Q25 Q75
0 0.3814 0.4992
1 0.3321 0.4499
2 0.3925 0.5114
3 0.3388 0.4521
4 0.3688 0.4841
5 0.2373 0.3556
6 0.3454 0.5011
7 0.1841 0.3589
=== ACTOR-WISE SIMILARITY SUMMARY (SPANISH) ===
Actor Mean Similarity Std Similarity Min Similarity Max Similarity \
0 claude 0.6889 0.0797 0.3324 0.8839
1 gemini 0.6518 0.0739 0.3134 0.8834
2 gemma 0.5785 0.0829 0.2602 0.8293
3 gpt3.5 0.6824 0.0819 0.2926 0.8889
4 gpt4 0.7024 0.0767 0.4103 0.9243
5 human 0.4924 0.0835 0.1583 0.7726
6 llama 0.4494 0.1419 -0.0011 0.8572
7 mistral 0.4563 0.1733 -0.0846 0.8680
Q25 Q75
0 0.6423 0.7449
1 0.6038 0.7040
2 0.5246 0.6374
3 0.6342 0.7400
4 0.6552 0.7568
5 0.4391 0.5502
6 0.3510 0.5512
7 0.3377 0.5891
=== ACTOR-WISE SIMILARITY SUMMARY (FRENCH) ===
Actor Mean Similarity Std Similarity Min Similarity Max Similarity \
0 claude 0.5962 0.0822 0.2469 0.8832
1 gemini 0.5413 0.0854 0.2109 0.8821
2 gemma 0.5494 0.1008 -0.0480 0.8328
3 gpt3.5 0.5466 0.0794 0.2069 0.8816
4 gpt4 0.5725 0.0843 0.1681 0.8835
5 human 0.4297 0.0793 0.0501 0.8081
6 llama 0.4311 0.1194 -0.1077 0.8873
7 mistral 0.3586 0.1409 -0.1055 0.8478
Q25 Q75
0 0.5396 0.6535
1 0.4828 0.5997
2 0.4990 0.6186
3 0.4938 0.6012
4 0.5160 0.6306
5 0.3789 0.4833
6 0.3572 0.5091
7 0.2627 0.4579
save_analysis_results("base", column_summary_df_base, "actor_wise")
save_analysis_results("br", column_summary_df_br, "actor_wise")
save_analysis_results("de", column_summary_df_de, "actor_wise")
save_analysis_results("es", column_summary_df_es, "actor_wise")
save_analysis_results("fr", column_summary_df_fr, "actor_wise")
display_edge_scenario_similarities(
embeddings_dict_base, actors_base, reason_types_base, df_cleaned_base, "Base"
)
display_edge_scenario_similarities(
embeddings_dict_br, actors_br, reason_types_br, df_cleaned_br, "Portuguese"
)
display_edge_scenario_similarities(
embeddings_dict_de, actors_de, reason_types_de, df_cleaned_de, "German"
)
display_edge_scenario_similarities(
embeddings_dict_es, actors_es, reason_types_es, df_cleaned_es, "Spanish"
)
display_edge_scenario_similarities(
embeddings_dict_fr, actors_fr, reason_types_fr, df_cleaned_fr, "French"
)
Processing actor 1/8: bison
Processing actor 2/8: claude
Processing actor 3/8: gemma
Processing actor 4/8: gpt3.5
Processing actor 5/8: gpt4
Processing actor 6/8: human
Processing actor 7/8: llama
Processing actor 8/8: mistral
====================================================================================================
EDGE SCENARIO SIMILARITY CASES (BASE)
====================================================================================================
👥 HUMAN RESPONSES
============================================================
TOP 5 LOWEST SIMILARITY CASES (Most semantically different answers)
------------------------------------------------------------
1. Similarity: -0.2807
Scenario 1 (ID: z0wjz0): AITA for not doing anything for christmas this year and making my husband livid?
Human Comment 1: NTA you have 3 weeks to get a lawyer, move out and get a job. You and children are at the very bottom of his priority list. That is not going to change. Good luck
Scenario 2 (ID: 11ig2ll): AITA for blowing up at my girlfriend because she keeps zoning out?
Human Comment 2: Info: are you having a conversation or just talking at her without letting her talk at all? Because I’ve dated those guys and of course I was nodding off as they blabbered on about themselves and asked me nothing about myself or my day.
2. Similarity: -0.2716
Scenario 1 (ID: 10gk534): AITA for asking my boss if he would be willing to switch my hours around because I am not a morning person?
Human Comment 1: YTA You’re too new to be on job sites alone and whatnot. You’re off every day by 1:30 and whatnot. You should go to bed earlier and whatnot. You should get up earlier and whatnot. And you should never be late for work and whatnot. You’re not innovative, just lazy and whatnot.
Scenario 2 (ID: 10oqjrb): AITA for not agreeing to be co-trustee of my parents estate with my sister?
Human Comment 2: NTA. We had a similar problem when my grandfather died. My brother and one of our cousins were co-trustees. Cousin was constantly telling the rest of the family that my brother didn’t know what he was doing and that cousin should have been sole trustee. Cousin twice ruined negotiations to sell grandfather’s property because he insisted that he could get a better price. In the end it took a year longer to sell and we got $100,000 less than we would have if the property had been sold to the first buyer.
3. Similarity: -0.2595
Scenario 1 (ID: yz2qgk): AITA for telling my mother that her son (my brother) is a creep?
Human Comment 1: NTA. He is purposely making your relationship very sexual. Be careful. I would not be alone with him. Put a lock on your bedroom door or find some way to block it so he can't come in while you are sleeping. Move out if you can. Take a self-defense class. This does not sound like a safe situation.
Scenario 2 (ID: zxhhay): AITA for shaming my brother for asking me for money?
Human Comment 2: What were the wishes of the parents who died?
4. Similarity: -0.2548
Scenario 1 (ID: 10pjlcz): AITA for going to my friends wedding instead of going to my GF friends wedding where she is the maid of honor?
Human Comment 1: NTA.
*"My GF says I should be there for her during her friends wedding and I’m choosing my friends wedding over her."*
You should ask her why this goes only one way. The fact that she will be the maid of honor also means that you would be spending a lot of time alone if you went to her friend's wedding.
Scenario 2 (ID: 11i29tc): AITA for getting rid of my son's tickets
Human Comment 2: INFO:
Are these behaviors new or has your son always experienced impulse control problems and shown aggressive behaviors in daycare? Do you see this behavior at home or just at daycare? Do you notice any correlation in a spike in behaviors when his father is or isn't around or are the problem behaviors constant?
If these behaviors have been around for more than a few months, you may want to get a medical evaluation done to see if there could be something going on like ADHD or ASD that might be impacting your son's ability to self-regulate.
5. Similarity: -0.2543
Scenario 1 (ID: 10pnzdq): AITA for selling the house I previously lived in
Human Comment 1: NTA.
It’s rough because there’s kids involved. Obviously you don’t want to rob them of shared custody with their father, but at the same time you’re not robbing them of a home as you have one. I’m assuming this wouldn’t even rob him of his kids, despite the sole custody, if you guys could work out some sort of amicable visitation agreements for him to still see/spend time with the kids.
At the end of the day he was given a year and a half. Now he’s asking for another year. By the end of this it’ll be almost three years with no guarantee their credit will be improved by then to refinance the house.
They knew the terms.
Scenario 2 (ID: 114rhqh): AITA for wearing parfume at work?
Human Comment 2: YTA your perfume should be discovered, not announce your presence. She has a reaction to it and you think you smelling a certain way is more important than her physical well being? Strong perfumes have always made me sick too. It's just selfish to go into a workplace smelling that strong.
Don't wear it at work.
TOP 5 HIGHEST SIMILARITY CASES (Most semantically similar answers)
------------------------------------------------------------
1. Similarity: 0.8933
Scenario 1 (ID: zarm3r): AITA For Being Honest/Blunt with My Family
Human Comment 1: Nta. If she didn't want to be seen as a liar she shouldn't have lied.
Scenario 2 (ID: 10xdj2z): AITA for exposing my coworker which caused her to quit.
Human Comment 2: NTA. If she doesn't want people calling her a liar, she should stop lying.
2. Similarity: 0.8999
Scenario 1 (ID: yoc1k7): AITA for telling my Mom to leave the room?
Human Comment 1: NTA. Your mom is a piece of work.
Scenario 2 (ID: 10k5qx5): AITA for telling my mum I can be tired too?
Human Comment 2: NTA. Your mom needs to grow up.
3. Similarity: 0.9010
Scenario 1 (ID: z7u1q4): AITA for not congratulating my sister in law on her planned pregnancy and telling her she's making a mistake instead?
Human Comment 1: NTA. Don’t give them money anymore, even if you can afford to do so.
Scenario 2 (ID: zir9ta): WIBTA If I quietly went ahead to sell the prize I won from a Lucky Draw
Human Comment 2: NTA but stop giving them money.
4. Similarity: 0.9029
Scenario 1 (ID: 11beh85): AITA for kicking my MIL out from my daughters birthday party
Human Comment 1: NTA and obligatory “you don’t have a MIL problem, you have a husband problem”. He’s the one should be dealing with this situation.
Scenario 2 (ID: 11ns7g9): AITA for not letting my MIL spend the night at our house anymore?
Human Comment 2: NTA and obligatory “you don’t have a MIL problem, you have a husband problem” comment.
5. Similarity: 0.9355
Scenario 1 (ID: 10v645e): AITA for blocking my daughter’s father?
Human Comment 1: Nta he's not a dad he's a sperm donor
Scenario 2 (ID: 11n8kd9): AITA for publicly insulting my ex?
Human Comment 2: NTA - he is the sperm donor, certainly not a father.
🤖 LLM RESPONSES
============================================================
TOP 5 LOWEST SIMILARITY CASES (Most semantically different answers)
------------------------------------------------------------
1. Similarity: -0.1905 | Model: BISON
Scenario 1 (ID: z5evh9): AITA for not mentioning that I have a boyfriend before a guy bought me drinks
Scenario 2 (ID: 10i9b4d): AITA for continuing to cook hard-boiled eggs even though my daughter is sensitive to the smell?
BISON Reasoning:
Scenario 1: You are not obligated to disclose your relationship status to every guy who talks to you at a bar. If a guy is interested in you and wants to pursue something romantic, it is his responsibility to ask if you are single. You are not responsible for managing his expectations or protecting his feelings.
It is also not your fault that the guys got upset when they found out you had a boyfriend. They should have asked before buying you drinks if they were only interested in you if you were single.
You did nothing wrong in this situation. You were polite and friendly to the guys, and you did not lead them on. If they were disappointed that you were not available, that is their problem, not yours.
Scenario 2: You are not the asshole for refusing to stop eating hard-boiled eggs. Your daughter has ASD, which causes her to be very sensitive to certain smells, and hard-boiled eggs are one of them. However, you are not obligated to stop eating foods that you enjoy just because they offend your daughter. You should try to be considerate of her sensitivities, but you should not have to change your diet for her.
2. Similarity: -0.1869 | Model: BISON
Scenario 1 (ID: yptrqx): AITA for not moving back in and helping my parents financially f22
Scenario 2 (ID: 10wo4z1): AITA - I admitted to my wife that two women waving on the highway was an ego boost and now she's mad at me.
BISON Reasoning:
Scenario 1: You are not obligated to move back in with your parents, even if they are struggling financially. You have your own life to live and your own plans for the future. It would be unfair to you to give up those plans in order to help your parents. If you do decide to move back in, make sure that you have a clear understanding of the expectations and responsibilities that come with it.
Scenario 2: You are not an asshole for being honest with your wife about your feelings. It is natural to feel good when someone is attracted to you, and it is not wrong to admit that. However, it is important to be respectful of your wife's feelings and to understand that she may not be comfortable with you being the object of other women's attention. It would be helpful to talk to her about why she is feeling this way and to try to come to an understanding.
3. Similarity: -0.1828 | Model: BISON
Scenario 1 (ID: yyr4g2): AITA for wearing white to my wedding?
Scenario 2 (ID: z3k72w): AITA for sending my son away to rehab for smoking weed?
BISON Reasoning:
Scenario 1: You are not the asshole for wearing the dress you wanted to wear on your wedding day. Your MIL and J are the assholes for trying to control what you wear. It is your wedding day, not theirs. You should be able to wear whatever you want.
Scenario 2: You are not the asshole for putting your son in rehab for weed. Weed can be very addictive, and it can have a negative impact on your son's life. He is failing his classes, and he is smoking alone most of the time. This is a serious problem, and you are right to be concerned. Rehab is the best way to help your son get his life back on track.
4. Similarity: -0.1813 | Model: BISON
Scenario 1 (ID: yo7m03): AITA for kissing my wife in the cafe I work at?
Scenario 2 (ID: yptrqx): AITA for not moving back in and helping my parents financially f22
BISON Reasoning:
Scenario 1: You were flirting with your wife, not some random woman. The other customer was rude and intrusive, and her reaction was completely out of line. You did nothing wrong.
Scenario 2: You are not obligated to move back in with your parents, even if they are struggling financially. You have your own life to live and your own plans for the future. It would be unfair to you to give up those plans in order to help your parents. If you do decide to move back in, make sure that you have a clear understanding of the expectations and responsibilities that come with it.
5. Similarity: -0.1776 | Model: BISON
Scenario 1 (ID: zkgml8): AITA for sending my son to a homeless shelter?
Scenario 2 (ID: 102c8tc): AITA for wearing my engagement ring when I knew my stepsister was getting engaged?
BISON Reasoning:
Scenario 1: You are not an asshole for setting boundaries for your son, but you are an asshole for sending him to a homeless shelter. You are his parent and you have a responsibility to take care of him, even if he is an adult. There are other options available to you, such as getting him into a residential treatment program or finding a group home that can provide him with the support he needs.
Scenario 2: You didn't do anything wrong by wearing your engagement ring. You were engaged for weeks and everyone already knew about it, so it's not like you were trying to steal Rachel's thunder. Matt and Rachel are the ones who are being unreasonable. They are the ones who decided to propose at a party where you were also going to be wearing your engagement ring. They should have considered that people might compare their rings.
TOP 5 HIGHEST SIMILARITY CASES (Most semantically similar answers)
------------------------------------------------------------
1. Similarity: 0.9716 | Model: LLAMA
Scenario 1 (ID: zevw6w): AITA for refusing to pay for my sister's wedding?
Scenario 2 (ID: 10fii8m): AITA for refusing to pay for my sister's expensive wedding?
LLAMA Reasoning:
Scenario 1: The OP is being selfish and unfair by refusing to contribute to their sister's wedding without providing any valid reasons for their decision. The OP has their own financial responsibilities and goals, but that does not give them the right to expect their sister to foot the bill for her own wedding. The OP's decision to withhold financial support is a personal choice, and it is not fair to expect their sister to compromise her own financial stability for their own personal goals. The OP's refusal to contribute to their sister's wedding is a clear violation of the "sisterly love" and support that is expected in such situations.
Scenario 2: The OP is the asshole in this scenario. By not wanting to contribute to their sister's wedding, they are being selfish and prioritizing their own financial stability over their sister's happiness. The OP has been saving up for their own future, but they are not willing to compromise or find a middle ground with their sister. They also had a confrontational and hurtful response to their sister's request, which only added to the tension and hurt feelings. The OP's actions have caused unnecessary conflict and hurt to their relationship with their sister, making them the asshole in this situation.
2. Similarity: 0.9686 | Model: BISON
Scenario 1 (ID: 11csryl): AITA for wanting to go straight home after my flight?
Scenario 2 (ID: 11da6br): AITA for wanting to go straight home after my flight?
BISON Reasoning:
Scenario 1: You are not an asshole for wanting to rest after a long trip. It is understandable that your dad is disappointed that you didn't want to spend time with him, but he shouldn't be taking it out on you. You are allowed to make your own decisions about how you spend your time, and you don't owe anyone your company.
Scenario 2: You are not an asshole for wanting to rest after a long trip. You are also not an asshole for not wanting to spend hours talking to your family when you are tired. It is understandable that your dad is disappointed, but he should not be taking it out on you.
3. Similarity: 0.9681 | Model: CLAUDE
Scenario 1 (ID: 11csryl): AITA for wanting to go straight home after my flight?
Scenario 2 (ID: 11da6br): AITA for wanting to go straight home after my flight?
CLAUDE Reasoning:
Scenario 1: The OP is not the asshole in this scenario. After a long and tiring trip, they simply wanted to go home and rest in their own space, which is a reasonable request. The OP communicated their desire to their family, and there was a miscommunication about the keys. The OP's family assuming they would come over for a visit without the keys was an unreasonable expectation, especially given the OP's exhaustion from travel. The OP's dad acting standoffish and offended is an overreaction, as the OP did nothing wrong in prioritizing their own rest and comfort after a difficult trip.
Scenario 2: The OP is not the asshole in this scenario. After a long and tiring trip, they understandably wanted to go home and rest alone before socializing. Asking their parents to bring the keys they were holding was a reasonable request, and their parents' assumption that the OP would come over for a visit was not the OP's responsibility. The OP made an effort to still discuss their trip on the way home, but prioritizing their own need for rest after travel is not an asshole move. The OP's dad acting standoffish and offended seems like an overreaction, as the OP did nothing wrong in this situation.
4. Similarity: 0.9658 | Model: CLAUDE
Scenario 1 (ID: 10kqh0n): AITA for yelling at my husband's sister?
Scenario 2 (ID: 10kzwkq): AITA for yelling at my husband's sister?
CLAUDE Reasoning:
Scenario 1: The OP is not the asshole in this scenario. They have made a reasonable and well-intentioned decision to delay providing their 10-year-old daughter with a smartphone until she is 13 years old. This decision is based on valid concerns about protecting their child from potential dangers of unfettered internet access and social media at a young age. The OP's sister-in-law (SIL) has repeatedly criticized and pressured the OP about this decision, despite the OP's explanations. The OP's outburst of yelling at the SIL was understandable given the persistent and unreasonable criticism from the SIL. The OP should not be expected to tolerate such ongoing harassment about their parenting choices, especially when those choices are made with their child's best interests in mind. The SIL and the in-laws are the ones who are in the wrong here for not respecting the OP's parental authority and decision-making.
Scenario 2: The OP is not the asshole in this scenario. They have made a reasonable and well-intentioned decision to delay providing their 10-year-old daughter with a smartphone until she is 13 years old. This decision is based on valid concerns about protecting their child from potential dangers of unfettered internet access and social media at a young age. The OP's SIL has repeatedly criticized and undermined this parenting decision, going so far as to suggest the OP is being "cruel" and that their daughter will "hate" and "resent" them. After enduring these comments for years, the OP's outburst in response was understandable and justified. The SIL's behavior of constantly judging the OP's parenting choices is the real asshole move here. The OP is simply trying to do what they believe is best for their child, and should not be made to feel guilty for that.
5. Similarity: 0.9583 | Model: GEMMA
Scenario 1 (ID: zevw6w): AITA for refusing to pay for my sister's wedding?
Scenario 2 (ID: 10fii8m): AITA for refusing to pay for my sister's expensive wedding?
GEMMA Reasoning:
Scenario 1: The OP is not the asshole in this scenario. While they may not be able to afford to contribute financially to their sister's wedding, their refusal is not necessarily selfish or unfair. They have their own financial responsibilities and goals, and they have the right to prioritize those. The sister's threat of cutting OP out of the wedding party is manipulative and over-the-top. OP's other siblings may be siding with their sister out of sympathy or obligation, but that does not mean that OP is necessarily in the wrong.
Scenario 2: The OP is not the asshole in this scenario. While they may not be as close to their sister as she would like, they have a valid reason for not wanting to spend so much money on the wedding. Their own financial stability is more important than their sister's wedding expenses. Their sister's reaction is overblown and her accusations of selfishness are not justified.
Processing actor 1/8: claude
Processing actor 2/8: gemini
Processing actor 3/8: gemma
Processing actor 4/8: gpt3.5
Processing actor 5/8: gpt4
Processing actor 6/8: human
Processing actor 7/8: llama
Processing actor 8/8: mistral
====================================================================================================
EDGE SCENARIO SIMILARITY CASES (PORTUGUESE)
====================================================================================================
👥 HUMAN RESPONSES
============================================================
TOP 5 LOWEST SIMILARITY CASES (Most semantically different answers)
------------------------------------------------------------
1. Similarity: -0.0721
Scenario 1 (ID: 1l1u28p): Sou babaca por terminar com meu namorado desempregado?
Human Comment 1: EOT
uma terapia ou coisa do tipo te faria bem, entender de verdade pq tu tá se sentindo mal por terminar com um cara desses e não entrar em furada de novo no futuro
Scenario 2 (ID: 1kz7g9e): Sou babaca por ter ficado com a menina que meu melhor amigo gosta?
Human Comment 2: https://preview.redd.it/etcekkdvxx3f1.jpeg?width=1920&format=pjpg&auto=webp&s=5c6bd0364eabd718943794fc46b72c56e0bf6627
2. Similarity: -0.0640
Scenario 1 (ID: 1lch5pk): Sou babaca por querer diminuir gastos com minha namorada bilionária(update)
Human Comment 1: https://preview.redd.it/bc4opyn9777f1.png?width=320&format=png&auto=webp&s=5fd0eafadc029eca783587a625c962428b399aff
Scenario 2 (ID: 1jt1tyk): Sou babaca por tirar o celular da minha sobrinha?
Human Comment 2: A esmagadora maioria dos maus comportamentos de crianças e adolescentes se perpetuam justamente porque gente sem noção estimula comportamentos abusivos e desrespeitosos.
Seja firme e tenha clareza de propósito.
3. Similarity: -0.0611
Scenario 1 (ID: 1lch5pk): Sou babaca por querer diminuir gastos com minha namorada bilionária(update)
Human Comment 1: https://preview.redd.it/bc4opyn9777f1.png?width=320&format=png&auto=webp&s=5fd0eafadc029eca783587a625c962428b399aff
Scenario 2 (ID: 1l1u28p): Sou babaca por terminar com meu namorado desempregado?
Human Comment 2: EOT
uma terapia ou coisa do tipo te faria bem, entender de verdade pq tu tá se sentindo mal por terminar com um cara desses e não entrar em furada de novo no futuro
4. Similarity: -0.0578
Scenario 1 (ID: 1lch5pk): Sou babaca por querer diminuir gastos com minha namorada bilionária(update)
Human Comment 1: https://preview.redd.it/bc4opyn9777f1.png?width=320&format=png&auto=webp&s=5fd0eafadc029eca783587a625c962428b399aff
Scenario 2 (ID: 1l6cd8b): Sou babaca por não ter impedido meu marido de bater no nosso filho?
Human Comment 2: EOB. Interrompeu o tratamento de um adolescente bipolar e quando ele - surpresa! - ficou descompensado o puniu com agressão física.
5. Similarity: -0.0575
Scenario 1 (ID: 1kz7g9e): Sou babaca por ter ficado com a menina que meu melhor amigo gosta?
Human Comment 1: https://preview.redd.it/etcekkdvxx3f1.jpeg?width=1920&format=pjpg&auto=webp&s=5c6bd0364eabd718943794fc46b72c56e0bf6627
Scenario 2 (ID: 1jpmmy8): Sou babaca por reclamar de um trote da faculdade?
Human Comment 2: Cara, honestamente, já teve trote que gente morreu, teve trote que a pessoa ficou com sequelas, gente alegando que foi abusada, que foi hospitalizado por causa de álcool. Isso é ir longe demais.
Honestamente, se você quisesse tanto usar a sala de estudo você teria ido na reitoria, teria falado com algum funcionário da faculdade pedindo autorização, tentado fazer o cadastro do cartão no atendimento ao aluno. Resumindo, você poderia ter ido atrás não só esperado o site voltar a funcionar.
Não acho que o trote foi longe demais, você que esta sentido/com o ego ferido, por ter sido enganado.
TOP 5 HIGHEST SIMILARITY CASES (Most semantically similar answers)
------------------------------------------------------------
1. Similarity: 0.7791
Scenario 1 (ID: 1n5stqm): Sou babaca por querer participar do casamento da minha melhor amiga?
Human Comment 1: É o casamento dela e ela decide quem chamar, mas é muito sem noção convidar uma pessoa do casal e a outra não, eu no seu lugar tentaria mais uma vez falar com a amiga pra pelo menos poder minha namorada na festa, caso contrário declinaria do posto de padrinho
Scenario 2 (ID: 1mb2vq0): Eu sou babaca por recusar casar agora ?
Human Comment 2: NEOB
Os assuntos de casal são exclusivamente DO CASAL.
Você não está errada mas acho válido vc reforçar pro seu namorado fazer a parte dele : que é também dizer o mesmo para a família dele, não só você.
2. Similarity: 0.7807
Scenario 1 (ID: 1mp6pqd): Sou babaca por não convidar minha família para meu casamento?
Human Comment 1: Amiga, acho que o mais importante no dia do casamento é você estar cercada de pessoas que te querem bem e que querem o bem da relação de vocês.
Se você sente que não quer sua família no dia, pronto, está resolvido, é seu dia e você deve chamar quem você quiser, quem vai te fazer feliz ver ali junto aos convidados.
Porém, se vc está preocupada com o possível preconceito deles, mas ainda assim vai sentir falta da presença deles, se você tem certeza que eles querem a felicidade de vocês, que gostam do seu noivo também, daí acho que vale a pena ter uma conversa franca.
Vale sentar com seus pais, explicar que vai se casar, mas que vai ser do jeito de vocês e pergunta se eles gostariam de ir, se eles se sentiriam confortáveis de prestigiar vocês num dia tão importante mesmo que seja de uma forma diferente da que eles acreditam... E você vê as reações e toma sua decisão. Nesse cenário, daí acho que não vale chamar aquela tia chata que vc sabe que vai ficar resmungando, mas somente aqueles que vc faria questão da presença e que estivessem dispostos a engolir suas crenças por você. Então não seria bem um "convite", seria mais uma conversa pra entender como eles se sentiriam e só depois vc convida.
Enfim, analisa como vc se sente. Se for não é não, vc não é babaca. Mas se estiver em dúvida não custa conversar.
Scenario 2 (ID: 1mogjo9): Sou babaca por odiar o meu pai e não conseguir perdoar ele?
Human Comment 2: Vou te falar o seguinte:
Seu pai falhou muito com você e é perfeitamente compreensível você não querer/conseguir perdoar ele. Se ele não tem nenhum tipo de arrependimento, acho que você também não tem que perdoar, até porque perdão é algo dado a quem pede.
Por outro lado, rancor, ódio e qualquer tipo de sentimento negativo é um peso na vida. Isso é algo que sempre vai martelar no fundo da sua cabeça. Perdoar é tirar esse peso. É o remédio pra você seguir em frente e abrir a oportunidade pra novos capítulos na sua vida.
Converse com ele e tente estar aberto a ouvir. Acho que te fará bem.
3. Similarity: 0.7868
Scenario 1 (ID: 1mp6pqd): Sou babaca por não convidar minha família para meu casamento?
Human Comment 1: Amiga, acho que o mais importante no dia do casamento é você estar cercada de pessoas que te querem bem e que querem o bem da relação de vocês.
Se você sente que não quer sua família no dia, pronto, está resolvido, é seu dia e você deve chamar quem você quiser, quem vai te fazer feliz ver ali junto aos convidados.
Porém, se vc está preocupada com o possível preconceito deles, mas ainda assim vai sentir falta da presença deles, se você tem certeza que eles querem a felicidade de vocês, que gostam do seu noivo também, daí acho que vale a pena ter uma conversa franca.
Vale sentar com seus pais, explicar que vai se casar, mas que vai ser do jeito de vocês e pergunta se eles gostariam de ir, se eles se sentiriam confortáveis de prestigiar vocês num dia tão importante mesmo que seja de uma forma diferente da que eles acreditam... E você vê as reações e toma sua decisão. Nesse cenário, daí acho que não vale chamar aquela tia chata que vc sabe que vai ficar resmungando, mas somente aqueles que vc faria questão da presença e que estivessem dispostos a engolir suas crenças por você. Então não seria bem um "convite", seria mais uma conversa pra entender como eles se sentiriam e só depois vc convida.
Enfim, analisa como vc se sente. Se for não é não, vc não é babaca. Mas se estiver em dúvida não custa conversar.
Scenario 2 (ID: 1lwlgha): Sou babaca por estar pensando em terminar por causa disso?
Human Comment 2: >Quando chegamos em casa, ele tirou minha roupa e ficou me pedindo p fazer sexo sem camisinha (nunca fazemos sem camisinha). Ele ficou em cima de mim insistindo, mas eu não quis, então ele desistiu.
Ou seja, ele tentou se aproveitar de um momento de vulnerabilidade seu pra te coagir a fazer alguma coisa que sabe que você não faria sóbria, e depois ainda ficou insistindo pra que você tomasse um remédio que iria te apagar. Duvido muito que ele queria que você dormisse só pra ele poder dormir também, me parece que ele queria que você ficasse inconsciente pra poder fazer o que quisesse contigo.
>No dia seguinte ele ficou falando q cuidou de mim, q n sabia o q seria de mim sem ele. Não sei se tô sendo ingrata.
A audácia do filho da puta. Ele não te cuidou em momento algum, muito pelo contrário. Você não está sendo ingrata porque não tem nada nas ações dele que mereça gratidão, e sim reprovação.
Se eu fosse você eu também ficaria incomodada e isso abalaria MUITO a confiança que eu sinto na pessoa. Fica esperta com esse homem aí, vê se não existem outras red flags no comportamento dele.
NEOB.
4. Similarity: 0.7883
Scenario 1 (ID: 1mp6pqd): Sou babaca por não convidar minha família para meu casamento?
Human Comment 1: Amiga, acho que o mais importante no dia do casamento é você estar cercada de pessoas que te querem bem e que querem o bem da relação de vocês.
Se você sente que não quer sua família no dia, pronto, está resolvido, é seu dia e você deve chamar quem você quiser, quem vai te fazer feliz ver ali junto aos convidados.
Porém, se vc está preocupada com o possível preconceito deles, mas ainda assim vai sentir falta da presença deles, se você tem certeza que eles querem a felicidade de vocês, que gostam do seu noivo também, daí acho que vale a pena ter uma conversa franca.
Vale sentar com seus pais, explicar que vai se casar, mas que vai ser do jeito de vocês e pergunta se eles gostariam de ir, se eles se sentiriam confortáveis de prestigiar vocês num dia tão importante mesmo que seja de uma forma diferente da que eles acreditam... E você vê as reações e toma sua decisão. Nesse cenário, daí acho que não vale chamar aquela tia chata que vc sabe que vai ficar resmungando, mas somente aqueles que vc faria questão da presença e que estivessem dispostos a engolir suas crenças por você. Então não seria bem um "convite", seria mais uma conversa pra entender como eles se sentiriam e só depois vc convida.
Enfim, analisa como vc se sente. Se for não é não, vc não é babaca. Mas se estiver em dúvida não custa conversar.
Scenario 2 (ID: 1kfikc1): Sou babaca por querer sair de perto da minha mãe?
Human Comment 2: NEOB
Essa é a hora de você atuar com estratégia. É hora de dar razão pra ela.
Senta com ela, com calma, sem briga e sem grito. Diz que, quando ela falou de te botar pra fora de casa, você ficou bastante triste, mas que, depois, você pensou, refletiu e viu que ela está certa: que ela tem todo o direito de decidir que você tem que ir embora.
Não discute, não joga na cara sobre o fato de que é você que paga as coisas, etc. Joga de maneira objetiva e clara.
Aproveita que é começo de maio e diz pra ela que você vai se organizar e, até o fim de junho, você vai arrumar um lugar e sair de casa.
Não fala nada dinheiro. Se ela falar qualquer coisa, diz que você ainda não decidiu e vai pensar.
Fora isso, não dá muita informação.
5. Similarity: 0.7989
Scenario 1 (ID: 1ld9r7p): Sou babaca por "abandonar" meus pais?
Human Comment 1: NEOB
Mas é uma situação muito difícil. Seus pais podem morrer em breve ou podem durar mais dez ou quinze anos. Outra oportunidade como essa você não sabe quando terá, então eu acho que eu iria. Uma solução interessante seria você colocar uma cuidadora de confiança com eles e acompanhar tudo por câmeras.
Scenario 2 (ID: 1kqdg8v): Sou babaca por não ceder a imagem das câmeras do lugar que trabalho?
Human Comment 2: NEOB.
ele ta pedindo pra voce cometer um crime.
se ele perdeu o celular mesmo, diz pra ele fazer um BO e a policia deverá entrar em contato pra analizar as imageins da camera.
🤖 LLM RESPONSES
============================================================
TOP 5 LOWEST SIMILARITY CASES (Most semantically different answers)
------------------------------------------------------------
1. Similarity: -0.1239 | Model: MISTRAL
Scenario 1 (ID: 1mt9lmv): Eu sou babaca por mentir pro meu namorado?
Scenario 2 (ID: 1lle5ug): Sou babaca por ter reclamado no trem?
MISTRAL Reasoning:
Scenario 1: --- Im a bot, please PM me if Im wrong. User 1 , Your bf is toxic. He shouldve confided in you about his feelings towards the surprise party. He shouldve talked to you instead of shutting down all communication. You cant control his feelings but you can control your own. You had to do what you thought was best for you and for your sister. He shouldnt have tried to sabotage the surprise party for her, thats cruel. He doesnt seem to care about you or your feelings. Thats not a good relationship to have, especially when youre both adults. You deserve better. You deserve someone who actually wants to work with you, someone who wants to communicate with you and be there for you, not someone who makes you feel like shit for trying to do something nice for your sister. Youre . User 0 Thank you so much for your comment, it helped me a lot. Im going to try to do better next time.
Scenario 2: Mas depois de pensar um pouco, talvez eu tenha atingido o ponto de desespero e com isso não me controlaria da maneira que deveria...
2. Similarity: -0.0863 | Model: MISTRAL
Scenario 1 (ID: 1mxgyo6): Eu sou babaca por não devolver um gato que achei na rua?
Scenario 2 (ID: 1ko2uu6): Sou babaca por doar sangue só para conseguir Meia entrada?
MISTRAL Reasoning:
Scenario 1: User 3 , Your friend was not the asshole. She saved a gato that was in danger of being harmed by its owner. The owner is the asshole.
Scenario 2: Solução A solução utiliza o módulo `` para serializar o . python import serializador .dumpsobjeto, separador=,, indent=2 Testes Os testes foram escritos utilizando o módulo `pytest` e a biblioteca `requests` para simular uma requisição HTTP POST para o endpoint da API. python import pytest import requests from main import app @pytest.fixture def client return app.test_client def test_json_responseclient response = client.post/, = , Your here assert response.status_code == 200 assert response. == , Your here def test_invalid_jsonclient response = client.post/, = , Your here assert response.status_code == 400 assert response. == error Invalid Executando os te...
3. Similarity: -0.0787 | Model: MISTRAL
Scenario 1 (ID: 1luv2m6): sou babaca por isso?
Scenario 2 (ID: 1l784sd): Sou babaca por ameaçar cortar relações com minha mãe?
MISTRAL Reasoning:
Scenario 1: A resposta da IA foi , Sua namorada pode ser insegura sobre o futuro do relacionamento e está preocupada com a estabilidade de sua vida. Ela pode ter preconceito sobre as pessoas da sua família e pode ser mais consciente da falta de estabilidade financeira. Você pode ter um senso de responsabilidade e estar disposto a investir na CNH e no carro ou moto antes de se envolver em um contrato de aluguel. Ambos podem estar tentando se comprometer com o relacionamento e fazer o melhor para garantir uma vida estável e feliz. É possível que ela seja mais preocupada com a segurança financeira e a estabilidade emocional do que você, e isso pode levá-la a fazer comentários desagradáveis para você. Descrição do cenário Tenho 24 anos sou H, minha namorada tem 31 anos, pois bem somos de cidades diferentes 20km +-, ela tem 2 filhos de um relacionamento passado algo q nao me incomodo, no conhecemos a 1 ano e alguns meses, nas ultimas semanas ela comecou a me ...
Scenario 2: View Pollhttps//www.reddit.com/poll/x0i6u5 User 2 Is there any reason why youre getting downvoted? User 1 Probabaly because its bad practice to get the bot to do your homework for you, and because its written in the wrong language. User 2 Im not OP, but Im curious, how is this bad practice? User 1 The bots output is not guaranteed to be accurate. User 2 True, but if youre using it for practice, you should be able to check the results, no? User 1 I guess that would depend on how much you value the accuracy of the bots output. User 2 In that case, you should also be checking your own answers. The point of practice is to improve, right? User 1 The point of practice is to get better at something. The way you do that is by comparing your results with the correct answers. If you dont have correct answers, you dont know if youre improving. User 2 I agree with that. But Id also say that comparing your results with the bots output is a step towards knowing the correct answers. Youd need to know the correct answers to know if youre improving or not, but you can still improve by comparing your results with the bots output.
4. Similarity: -0.0735 | Model: MISTRAL
Scenario 1 (ID: 1ko2uu6): Sou babaca por doar sangue só para conseguir Meia entrada?
Scenario 2 (ID: 1jppc6t): Eu sou babaca por não deixar minha amiga usar minha casa?
MISTRAL Reasoning:
Scenario 1: Solução A solução utiliza o módulo `` para serializar o . python import serializador .dumpsobjeto, separador=,, indent=2 Testes Os testes foram escritos utilizando o módulo `pytest` e a biblioteca `requests` para simular uma requisição HTTP POST para o endpoint da API. python import pytest import requests from main import app @pytest.fixture def client return app.test_client def test_json_responseclient response = client.post/, = , Your here assert response.status_code == 200 assert response. == , Your here def test_invalid_jsonclient response = client.post/, = , Your here assert response.status_code == 400 assert response. == error Invalid Executando os te...
Scenario 2: . User 3 ` , Youre right to be uncomfortable with the situation. You dont know the guy shes interested in and you dont want him breaking your stuff. She should be mature enough to understand your concern and find another place to meet with him. However, if she does damage your property, you can always get a restraining order against her.`
5. Similarity: -0.0665 | Model: MISTRAL
Scenario 1 (ID: 1mt9lmv): Eu sou babaca por mentir pro meu namorado?
Scenario 2 (ID: 1kypwg6): Eu sou babaca por não querer esconder o meu TEA?
MISTRAL Reasoning:
Scenario 1: --- Im a bot, please PM me if Im wrong. User 1 , Your bf is toxic. He shouldve confided in you about his feelings towards the surprise party. He shouldve talked to you instead of shutting down all communication. You cant control his feelings but you can control your own. You had to do what you thought was best for you and for your sister. He shouldnt have tried to sabotage the surprise party for her, thats cruel. He doesnt seem to care about you or your feelings. Thats not a good relationship to have, especially when youre both adults. You deserve better. You deserve someone who actually wants to work with you, someone who wants to communicate with you and be there for you, not someone who makes you feel like shit for trying to do something nice for your sister. Youre . User 0 Thank you so much for your comment, it helped me a lot. Im going to try to do better next time.
Scenario 2: Alguma pessoa pode me dar uma forte palhada? Eu também gosto de jogos, mas não tô conseguindo fazer nada disso por ter que trabalhar, mas é um dos meus maiores sonhos pra no futuro ter um grupo de amigos e poder nos reunir e jogar, ou que eu consiga encontrar uma comunidade de amigos de forma online. P.S. Não tô querendo fazer isso por ganhar um valor monetário, se eu fosse por dinheiro eu ficaria na casa e não terei que fazer nada com as pessoas, eu quero fazer isso por amor próprio e por meu próprio bem, eu tô vendo muitas pessoas autistas tendo uma vida social e meio que ficando felizes com isso, e eu quero fazer isso também.
TOP 5 HIGHEST SIMILARITY CASES (Most semantically similar answers)
------------------------------------------------------------
1. Similarity: 0.9178 | Model: GPT4
Scenario 1 (ID: 1no289p): sou babaca por abandonar minha mãe?
Scenario 2 (ID: 1jz81gk): sou babaca por querer que minha mãe me dê a pensão que recebe do meu pai?
GPT4 Reasoning:
Scenario 1: Você não é o babaca aqui. É compreensível que você queira cuidar de suas próprias finanças e bem-estar, especialmente considerando que já está contribuindo significativamente com a pensão e seu salário. A pressão que sua mãe está colocando sobre você para que continue ajudando, mesmo quando você já está fazendo o máximo que pode, é injusta. Buscar uma solução que garanta que a pensão seja utilizada de forma adequada e que você tenha autonomia financeira é uma decisão responsável, não egoísta.
Scenario 2: Você não é o babaca por querer sua parte da pensão, já que a decisão do divórcio e a questão da pensão foram estabelecidas legalmente e sua mãe está se negando a cumprir com sua parte, apesar de não ter despesas relacionadas a você. É compreensível que você queira garantir seu sustento, especialmente considerando que sua avó e seu pai estão ajudando você financeiramente de maneira significativa. Sua mãe deve reconhecer suas responsabilidades, e sua necessidade de apoio financeiro é legítima.
2. Similarity: 0.9128 | Model: GEMINI
Scenario 1 (ID: 1no289p): sou babaca por abandonar minha mãe?
Scenario 2 (ID: 1kfikc1): Sou babaca por querer sair de perto da minha mãe?
GEMINI Reasoning:
Scenario 1: O OP não é o babaca. Ele está em uma situação difícil, onde está financeiramente sobrecarregado por sua mãe, que parece estar administrando mal o dinheiro. O OP está trabalhando, estudando e bancando suas próprias despesas, mas ainda assim está sendo explorado. A mãe do OP está usando o dinheiro da pensão e ainda pedindo mais, mesmo com dívidas significativas. O OP tem o direito de proteger suas finanças e bem-estar, e a decisão de não fornecer mais dinheiro e considerar outras opções é justificada.
Scenario 2: O OP não é o babaca. Ele está suportando financeiramente a mãe, pagando as contas e a faculdade, e está em uma situação difícil. A mãe está sendo ingrata e agressiva, pedindo dinheiro emprestado e criticando as compras do OP. O OP tem o direito de buscar sua independência e bem-estar, e a responsabilidade de cuidar da mãe não deveria recair totalmente sobre ele.
3. Similarity: 0.9107 | Model: LLAMA
Scenario 1 (ID: 1myaop3): Sou babaca por reduzir o apoio financeiro da minha avó para os filhos?
Scenario 2 (ID: 1k4wqws): Sou babaca por me recusar a ajudar financeiramente minha mãe?
LLAMA Reasoning:
Scenario 1: O OP é o babaca nesse cenário. Ele reduceu o apoio financeiro às suas famílias, que estão dependentes dele, alegando que ele precisa pensar no futuro. No entanto, ele não está pensando no futuro de sua avó, que precisa de economias para garantir sua saúde e bem-estar em longo prazo. Além disso, ele não está ajudando a resolver os problemas financeiros de seus pais, apenas evitando as consequências de suas próprias decisões irresponsáveis. Em vez de ajudar a encontrar uma solução equilibrada, ele chooseu dar um ultimato que pode causar mais problemas em vez de resolver os problemas atuais. Por isso, ele é o babaca nesse cenário.
Scenario 2: O OP postador original é o babaca nesse cenário. Ele não está em falta em sua situação, mas sim por não ajudar financeiramente sua mãe, apesar de ela sempre ajudar seu irmão. O OP sempre foi organizado com dinheiro e tem uma boa reserva de emergência, mas não veio para ajudar sua mãe em sua necessidade. Além disso, ele não está disposto a ajudar mesmo após ela vender o carro e usar quase todo o valor para pagar dívidas do irmão. Ele também não respeita as limitações que a mãe estabeleceu e ainda mais, insultou-a después de ela pedir ajuda. Por isso, o OP é o babaca nesse cenário.
4. Similarity: 0.9106 | Model: GPT4
Scenario 1 (ID: 1njludh): Sou babaca por não dar dinheiro pra minha mãe?
Scenario 2 (ID: 1jpxsn2): Sou babaca por não querer emprestar dinheiro para meu namorado?
GPT4 Reasoning:
Scenario 1: Você não é o babaca aqui, pois está tomando uma decisão sábia ao não querer ajudar financeiramente sua mãe, que tem um padrão de comportamento de gastar de forma irresponsável e acumular dívidas. Ajudar a sua mãe pode apenas oferecer uma solução temporária, sem resolver o problema subjacente de suas finanças. Além disso, é compreensível que você queira preservar seu próprio futuro financeiro e não se colocar em uma posição onde você possa ser explorado ou ficar em dívida também.
Scenario 2: Você não é o babaca nesta situação. Você ajudou seu namorado financeiramente em compras anteriores, mesmo sem querer, e é compreensível que você não queira arcar com os custos de um conserto para algo que ele decidiu comprar e que acabou não funcionando. É importante manter suas finanças em ordem e não se sentir pressionada a continuar ajudando financeiramente quando isso afeta sua situação econômica, especialmente se ele não está disposto a assumir a responsabilidade por sua compra.
5. Similarity: 0.9090 | Model: CLAUDE
Scenario 1 (ID: 1nnuciv): Sou babaca por nao qerer seguir a alimentação da familia do meu namorado?
Scenario 2 (ID: 1jp5jhu): Sou babaca por querer que o menino deficiente pare de me olhar?
CLAUDE Reasoning:
Scenario 1: Você não é o babaca nesta situação. Você tem o direito de escolher a sua alimentação e não deve ser obrigada a comer algo que não gosta ou que faz mal para a sua saúde, apenas por educação. É compreensível que você queira manter uma alimentação saudável, especialmente quando está na casa do seu namorado. Você tem sido educada e discreta ao lidar com isso, evitando confrontos desnecessários. Oferecer alternativas mais saudáveis, como o açaí, é uma ótima solução. Não há nada de errado em você ter seus próprios limites e preferências alimentares, desde que você os expresse de forma respeitosa, como você tem feito.
Scenario 2: Você não é o babaca nesta situação. O estudante que está assediando você de forma persistente e invasiva é claramente o culpado. Você tentou ser educada e estabelecer limites, mas ele não respeitou seus desejos e continuou com o comportamento inaceitável. A escola também falhou em tomar medidas efetivas para proteger você, mesmo após você ter relatado o assédio. Você está certa em exigir que a escola tome providências para garantir sua segurança e bem-estar. Ninguém deve ser forçado a tolerar um comportamento abusivo, independentemente das condições do agressor.
Processing actor 1/8: claude
Processing actor 2/8: gemini
Processing actor 3/8: gemma
Processing actor 4/8: gpt3.5
Processing actor 5/8: gpt4
Processing actor 6/8: human
Processing actor 7/8: llama
Processing actor 8/8: mistral
====================================================================================================
EDGE SCENARIO SIMILARITY CASES (GERMAN)
====================================================================================================
👥 HUMAN RESPONSES
============================================================
TOP 5 LOWEST SIMILARITY CASES (Most semantically different answers)
------------------------------------------------------------
1. Similarity: -0.0987
Scenario 1 (ID: 1nk1t5u): BIDA, weil ich meinen Mann bitte, nicht auf ein Konzert zu gehen, weil ich mit unserem Baby überfordert bin?
Human Comment 1: KAH ich denke es ist wichtig dass ich euch beide auch mal eine Auszeit nehmt und euren Hobbies oder Interessen sowohl gemeinsam als Familie, als auch mal alleine oder nur ihr beide nachgeht. Daher ist es schön dass du Verständnis für deinen Partner zeigst, aber auch genauso wichtig zu kommunizieren wie du dich dabei fühlst und deine Sorgen mitteilst.
Wie schon andere kommentiert haben, wäre es wahrscheinlich am sinnvollsten, sofern möglich, Freunde oder Familie um Hilfe zu bitten. Vielleicht kannst du ja auch mit Baby zu jemanden hin oder jemand kommt vorbei und unterstützt dich. Es haben doch sicherlich viele den Spruch gedrückt "Meld dich jederzeit wenn was ist." Ja dann ist doch jetzt der richtige Zeitpunkt dafür :)
Scenario 2 (ID: 1n52yp1): BIDA, Hochzeitstermin auf dem Geburtstag des Schwippschwagers
Human Comment 2: NDA - ein 42. Geburtstag, really?
2. Similarity: -0.0668
Scenario 1 (ID: 1n52yp1): BIDA, Hochzeitstermin auf dem Geburtstag des Schwippschwagers
Human Comment 1: NDA - ein 42. Geburtstag, really?
Scenario 2 (ID: 1luqckw): BIDA wenn ich die Taufpaten doch noch ändere?
Human Comment 2: KAH - du nicht, wenn du dich umentscheidest und die „Paten“ nicht, nur weil sie sich nicht melden - sofern es nicht noch Zusatzsinfos gibt. Was habt ihr denn besprochen, was deine Erwartungshaltung an Paten ist? Von „wir nehmen die Kinder auf sollten die Eltern sterben“ bis „Super involviert und ganz nah am Kind“ ist die spannbreite an Paten halt echt vielfältig. Wurde denn klar gemacht, was für dich Pate sein bedeutet und konnten sie klar machen, was sie willens sind zu tun? Falls nicht würde ich vll noch einmal das Gespräch suchen, damit sie mit deiner Entscheidung nicht aus allen Wolken fallen.
3. Similarity: -0.0365
Scenario 1 (ID: 1n4wb1f): BIDA weil ich die Familie meiner Frau ghoste?
Human Comment 1: NDA, wirf dem Schwiegervater Doppelmoral vor => „jung habe vor alt Respekt zu haben“
Die Oma hatte entschieden.
Scenario 2 (ID: 1lflfhb): BIDA wenn ich Pfand Flaschen an der Kasse zurück geben will weil der scheiß Automat mal wieder nicht funktioniert?
Human Comment 2: NDA.
Auch wenn die sicherlich auch ohne Pfand gut ausgelastete Mitarbeiterin natürlich weder fürs Gesetz noch für den kaputten Automaten was kann, ist der Laden, und damit sie, zur Rücknahme verpflichtet. Das ist dann halt einfach mal so.
Ich habe aber leider auch schon mehr als einmal die Situation erlebt, dass das Personal einfach nicht über solche Dinge informiert ist. Vermutlich hat nicht jeder überhaupt eine fachliche Ausbildung und es nimmt sicher auch nicht jede Aushilfe oder Teilzeitkraft an regelmäßigen Schulungen Teil 🤷🏻♂️
Rossmann zum Beispiel bietet -ohne Einschränkungen und ohne wenn und aber- eine Rücknahme innerhalb eines Jahres ab Kaufdatum "bei Nichtgefallen" an. Kann man so eindeutig auf der Internetseite nachlesen und ist offizielle Geschäftspolitik der Firma.
Ich hatte für ein Schweinegeld einen Mascara für meine Freundin gekauft, es war aber leider der falsche. Ich also wieder hin, erzählt sie mir dass Kosmetika ja Hygieneprodukte sind und sie die nicht zurück nehmen.
Ich freundlich auf die Rücknahmegarantie verwiesen, sie lauter und unverschämt geworden. Zweite Kollegin geholt, wusste auch nichts davon. "Ich arbeite hier, was wollen Sie mir denn erzählen!".
Schlussendlich haben sie "aus Freundlichkeit" und "ausnahmsweise" den Mascara zurück genommen. Ich habe es mir erspart, noch weiter zu diskutieren, dass genau nichts an ihrem Verhalten freundlich war und dass das auch keine Ausnahme war, sondern genau das, was Rossmann will, was sie tun sollen 🤦🏻♂️
4. Similarity: -0.0311
Scenario 1 (ID: 1n52yp1): BIDA, Hochzeitstermin auf dem Geburtstag des Schwippschwagers
Human Comment 1: NDA - ein 42. Geburtstag, really?
Scenario 2 (ID: 1luotuh): WidA wenn ich mich über einen Kollegen, der allgemeine sexistische Aussagen trifft beschwere?
Human Comment 2: NDA. Das Problem ist doch er und seine Denkweise. „Männer müssen lernen sich zu verteidigen“ joa im Kriegsfall ist er mit seiner schützenvereinerfahrung doch genauso nutzlos…
5. Similarity: -0.0242
Scenario 1 (ID: 1mmqsrn): BidA weil ich auf einer Berghütte nach einem Zimmer und Seife am Klo gefragt habe?
Human Comment 1: KAH: Es ist tatsächlich üblich, dass weder im Waschraum noch in der zugehörigen Toilette Seife vorhanden ist, dafür aber bei den Toiletten, die zum "Gastronomie"- Bereich gehören. Die Logik des Wortes stimmt also: Übernachtungsgäste bringen ihre Seife zum Waschen selbst mit, können diese also mit zum Klo schleppen.
Zu den Zimmern: die Hütten sind teils auf Monate hin ausgebucht, schau dir mal das Buchungssystem von Hut an. Wenn du also für das Lager Plätze bekommen hast ist es aus Sicht des Hüttenwirtes schon etwas merkwürdig, spontan das Hundezimmer buchen zu wollen - umgekehrt kann er Hundebesitzer schlecht in das Matratzenlager packen.
Wenn du noch wenig Erfahrung mit Hüttentouren gemacht hast, ist dein Verhalten verständlich, der Hüttenwirt ist umgekehrt vielleicht von der zunehmenden Durchtouristisierung der Berge genervt und deutet dich (fälschlicherweise ?) als jemanden, der am liebsten eine Pension und keine AV-Hütte auf dem Berg hätte.
Scenario 2 (ID: 1m5ev4b): BIDA weil wir sie vom Fremdgehen abhalten wollten?
Human Comment 2: Nda- ich habe da so eine Regel. Wenn jemand seinen Freund oder Freundin betrügt, wird er/sie sich nicht davor scheuen seinen besten Freund zu betrügen.
TOP 5 HIGHEST SIMILARITY CASES (Most semantically similar answers)
------------------------------------------------------------
1. Similarity: 0.6760
Scenario 1 (ID: 1nez495): BIDA weil wir falsch gärtnern?
Human Comment 1: NDA aber macht doch einfach einen Zaun hin... Egal ob Dieter das gut findet oder nicht...
Scenario 2 (ID: 1lc004p): BIDA weil ich kein Bußgeld für jemand anderen mit bezahlen will
Human Comment 2: NDA so einen Bullshit hab ich noch nie gehört. Das ist ihr Fehler, dann muss sie auch zahlen.
2. Similarity: 0.6771
Scenario 1 (ID: 1llvjgc): BIDA weil ich angetrunken ein Auto gewendet habe?
Human Comment 1: NDA. Wegen Privatgrund und nur Wendemanöver. Strasse ist selbstverständlich nogo.
Nächstes Mal lässte die eingekeilt stehen.
Scenario 2 (ID: 1li1dqh): BIDA weil ich in der Freizeit die Welt sehen will und Spaß haben will?
Human Comment 2: NDA.
einfach mal weniger auf der Arbeit erzählen um diesen ganzen Diskussionen aus dem Weg zu gehen.
3. Similarity: 0.6871
Scenario 1 (ID: 1lc004p): BIDA weil ich kein Bußgeld für jemand anderen mit bezahlen will
Human Comment 1: NDA so einen Bullshit hab ich noch nie gehört. Das ist ihr Fehler, dann muss sie auch zahlen.
Scenario 2 (ID: 1jwydiu): BIDA, weil ich mich bei der Hochbahn über einen Busfahrer beschwert habe?
Human Comment 2: NDA. Sowas geht gar nicht. Kann ich nachvollziehen und hätte ich auch gemacht.
4. Similarity: 0.6906
Scenario 1 (ID: 1lcmqat): BIDA weil ich bei der Hausaufteilung nicht zu viel geben will?
Human Comment 1: NDA, aber bitte nimm dir einen Anwalt, das ist eine Nummer zu groß für Reddit.
Scenario 2 (ID: 1lc004p): BIDA weil ich kein Bußgeld für jemand anderen mit bezahlen will
Human Comment 2: NDA so einen Bullshit hab ich noch nie gehört. Das ist ihr Fehler, dann muss sie auch zahlen.
5. Similarity: 0.6933
Scenario 1 (ID: 1loyn99): BIDA weil ich es der Kassiererin mit gleicher Münze heimzahle?
Human Comment 1: NDA so Schlagfertig wäre ich auch gerne. Alles richtig gemacht!
Scenario 2 (ID: 1jwydiu): BIDA, weil ich mich bei der Hochbahn über einen Busfahrer beschwert habe?
Human Comment 2: NDA. Sowas geht gar nicht. Kann ich nachvollziehen und hätte ich auch gemacht.
🤖 LLM RESPONSES
============================================================
TOP 5 LOWEST SIMILARITY CASES (Most semantically different answers)
------------------------------------------------------------
1. Similarity: -0.1597 | Model: GEMMA
Scenario 1 (ID: 1mdun6h): BIDA weil ich meiner Schwester nicht beim Umzug helfen will?
Scenario 2 (ID: 1jpwc7o): BIDA weil ich anderen nicht mehr helfen will?
GEMMA Reasoning:
Scenario 1: Die gesamte Atmosphäre einschließlich Geruchssucht and fehlerhafte Planung beim Auszug könnte als störungsfreee Handlung angesehen.
Scenario 2: Sorry if I upset you guys
2. Similarity: -0.1593 | Model: GEMMA
Scenario 1 (ID: 1jpwc7o): BIDA weil ich anderen nicht mehr helfen will?
Scenario 2 (ID: 1jnn2sd): BIDA Weil ich wegen einem streunenden Kater mit Zuhause den Tierschutz gerufen habe
GEMMA Reasoning:
Scenario 1: Sorry if I upset you guys
Scenario 2: Der Op sieht es selbst deutlich heraus dass sein Handeln zwar vigilante sei but gleichzeitig unangemessert gegenüber seinem eigenen Verhalten and dessen rechtzeitiger Verknuftigung beim Finden eines Tiergeschechts.
3. Similarity: -0.1564 | Model: MISTRAL
Scenario 1 (ID: 1ly41jk): BIDA, wenn ich mich bei Kindergeschrei entferne?
Scenario 2 (ID: 1k2346h): BIDA, weil ich nicht auf die Wünsche meiner Schwester eingehe?
MISTRAL Reasoning:
Scenario 1: User 0 , Youre an adult, you should have the emotional maturity to control your temper and not act on your impulses. Youre not the asshole for feeling the way you do, but youre an asshole for acting on those feelings.
Scenario 2: EDIT Die meisten von euch sind außerordentlich schön. Ich las fast alle Kommentare. Ich habe es auch mit meiner Schwester besprochen, wir haben miteinander getanzt. Ich glaube, ich habe so etwas nicht in meinem Leben jemals getan. Für diejenigen, die sich für die -Option entschieden haben, ist es nur gerecht, dass ich sie erkläre , weil ich nicht zu meiner Schwester gesprochen habe. Es war so, als wäre ich der Bösewicht. Aber vielleicht habe ich auch den richtigen Weg gewählt. Für diejenigen, die ich mit meiner Entscheidung nicht zufriedenstellen konnten, sollten Sie sich die Vorgeschichte ansehen. Ich habe sie leider nicht im Originalpost einfügen können. Danke für euer Mitgefühl. Ich glaube, ich habe mich sehr gelohnt. User 1 I would have bought the cards and said I bought the cards, but Im not buying a manga. Its not my place to tell her what to buy for her kid, and Im not going to buy something that I dont want for myself. If you want me to buy somethin...
4. Similarity: -0.1437 | Model: MISTRAL
Scenario 1 (ID: 1k2346h): BIDA, weil ich nicht auf die Wünsche meiner Schwester eingehe?
Scenario 2 (ID: 1jlsvk2): BIDA, weil ich mir Kommentare von wildfremden Menschen nicht gefallen lasse?
MISTRAL Reasoning:
Scenario 1: EDIT Die meisten von euch sind außerordentlich schön. Ich las fast alle Kommentare. Ich habe es auch mit meiner Schwester besprochen, wir haben miteinander getanzt. Ich glaube, ich habe so etwas nicht in meinem Leben jemals getan. Für diejenigen, die sich für die -Option entschieden haben, ist es nur gerecht, dass ich sie erkläre , weil ich nicht zu meiner Schwester gesprochen habe. Es war so, als wäre ich der Bösewicht. Aber vielleicht habe ich auch den richtigen Weg gewählt. Für diejenigen, die ich mit meiner Entscheidung nicht zufriedenstellen konnten, sollten Sie sich die Vorgeschichte ansehen. Ich habe sie leider nicht im Originalpost einfügen können. Danke für euer Mitgefühl. Ich glaube, ich habe mich sehr gelohnt. User 1 I would have bought the cards and said I bought the cards, but Im not buying a manga. Its not my place to tell her what to buy for her kid, and Im not going to buy something that I dont want for myself. If you want me to buy somethin...
Scenario 2: View Pollhttps//www.reddit.com/poll/s094g4 User 2 You shouldve just walked away. You were escalating the situation. User 1 Thats what I thought too, but Im curious how other people would handle it. User 2 You shouldve just walked away. You were escalating the situation. User 1 I get that. But if you dont escalate, theyll just keep going. User 2 They cant go any further than they already have. Its not like you can get arrested for husting. Youre not in any real danger.
5. Similarity: -0.1417 | Model: GEMMA
Scenario 1 (ID: 1mvox2e): BIDA weil ich meiner besten Freundin gesagt habe, dass sie mich mit meinem Auto ausnutzt – und jetzt alles auf einmal rausgelassen habe?
Scenario 2 (ID: 1jpwc7o): BIDA weil ich anderen nicht mehr helfen will?
GEMMA Reasoning:
Scenario 1: Oder habt Ihr diese ganzen Aktionen tatsächlich vergessen bzw./verschweigen lassen sich durch eure eigene Probleme/Stresssituationen andauernd vor euch selbst gegenüber als gegen meinen Einsatz bei diesem Problem؟ , =Es liegt sowohl beim Op sowie dessen Freundenin Versehenheit.
Scenario 2: Sorry if I upset you guys
TOP 5 HIGHEST SIMILARITY CASES (Most semantically similar answers)
------------------------------------------------------------
1. Similarity: 0.9054 | Model: GEMINI
Scenario 1 (ID: 1lo363c): BIDA wenn ich meine Schwiegermutter jetzt verachte?
Scenario 2 (ID: 1k20syg): BIDA weil ich den Kontakt zu meiner Schwiegermutter beendet habe?
GEMINI Reasoning:
Scenario 1: Der OP ist nicht das Arschloch. Die Schwiegermutter missachtet wiederholt die Wünsche des OP, missachtet die Gesundheit und das Wohlbefinden des Babys und setzt ihr eigenes Verlangen nach Interaktion über die Grenzen des OP. Die Weigerung des OP, das Verhalten der Schwiegermutter zu tolerieren und den Kontakt einzuschränken, ist eine gesunde Reaktion auf eine toxische Situation.
Scenario 2: Der OP ist in dieser Situation eindeutig nicht das Arschloch. Die Schwiegermutter hat sich über Jahre hinweg respektlos, abwertend und manipulativ verhalten. Der OP hat in angemessener Weise reagiert, indem sie den Kontakt abgebrochen hat, nachdem alle anderen Versuche, das Verhalten zu ändern, gescheitert waren. Die Schwiegermutter ist eindeutig diejenige, die das Problem verursacht, und die Schuld liegt bei ihr.
2. Similarity: 0.8969 | Model: LLAMA
Scenario 1 (ID: 1mp04n3): BIDA weil ich eine Katze ausgeliehen habe?
Scenario 2 (ID: 1jnn2sd): BIDA Weil ich wegen einem streunenden Kater mit Zuhause den Tierschutz gerufen habe
LLAMA Reasoning:
Scenario 1: The OP has clearly taken the cat without permission and has kept it in their home for an extended period of time. This is a violation of the cats personal space and freedom, and the OP should be held accountable for their actions. The fact that the OP was fascinated by the idea of having a pet and was tempted to keep the cat long-term only serves to reinforce their responsibility for the cats well-being. BDA
Scenario 2: After carefully evaluating the scenario you provided, I have decided to label the OP as - Youre the Asshole. The OPs actions were not only inconsiderate but also potentially harmful to the cats well-being. By taking the cat without the owners permission and keeping it in their home, the OP was depriving the cat of its rightful owner and home. The OP also failed to provide adequate care for the cat, as evident from the cats appearance and injuries. The OPs for taking the cat was based on their own assumptions and biases, without any concrete evidence or proof. They assumed that the owner was neglecting the cat and therefore took matters into their own hands. However, this assumption was unfounded, and the OP should have sought professional help or contacted the authorities instead of taking the law into their own hands. Furthermore, the OPs actions were not only illegal but also unethical. They took the cat without the owners consent and kept it for an extende...
3. Similarity: 0.8861 | Model: LLAMA
Scenario 1 (ID: 1ly1pxn): BIDA weil ich meine Mutter nicht mag?
Scenario 2 (ID: 1lid004): BIDA weil ich mich weigere, meiner Mutter zuerst zu schreiben?
LLAMA Reasoning:
Scenario 1: the information provided, I would label the situation as Youre the Asshole. The OP has described their mother as someone who has caused them significant distress and discomfort, both in the past and present. The OP has also mentioned that their mother has been intrusive and uninvited, and has provided unsolicited advice and hints despite being asked not to. The OPs partner has also experienced similar treatment from the mother. Given these circumstances, it seems that the OPs mother has not respected their boundaries or treated them with kindness and compassion. The OP has every right to feel frustrated and upset by their mothers behavior, and it is not unreasonable for them to set boundaries or distance themselves from their mother if necessary. Therefore, I believe that the OP is justified in feeling annoyed and disconnected from their mother, and that they are not the asshole in this situation.
Scenario 2: The OP is not the asshole in this situation. The mothers reaction was unreasonable and disproportionate to the situation. The OP had already apologized and explained that it was not intentional, and the mothers continued complaints suggest that she is not interested in resolving the issue. The OP has been polite and considerate in their responses, and it is not fair to expect them to be the only one to initiate communication. The mothers silence for a month and lack of response to the OPs questions suggest that she is not interested in maintaining a relationship with the OP.
4. Similarity: 0.8725 | Model: LLAMA
Scenario 1 (ID: 1k9yook): BIDA weil ich einer Freundin nicht mehr helfen möchte?
Scenario 2 (ID: 1jyez4u): BIDA weil ich Freundin in toxischer Beziehung nicht mehr helfen will
LLAMA Reasoning:
Scenario 1: Thank you for sharing this situation with me. It sounds like a very challenging and frustrating situation for you, and I can understand why you would feel the way you do. Based on the information provided, I would say that the OP original poster is partially responsible for the situation. While its understandable that they may feel frustrated with their friends behavior, they have not taken a proactive approach to addressing the issue. Instead, they have chosen to vent their frustrations in a public forum, which is not a healthy or productive way to handle the situation. I would give the OP the label Youre the Asshole because they have not taken a responsible or respectful approach to addressing the issue. They have instead chosen to attack and belittle their friend, which is not a constructive way to resolve the problem. In terms of why I have chosen this label, its because the OP has not shown any effort to understand or empathize with their friends perspective. The...
Scenario 2: Based on the information provided, it seems that the OP is the asshole in this situation. The OP has been trying to help the friend for six weeks, but instead of offering constructive support, they have been constantly criticizing and belittling the friends partner. The OPs behavior has caused the situation to escalate, and their lack of empathy and unwillingness to listen to the friends perspective has made it clear that they are not interested in finding a solution that works for everyone involved. Instead, they are more focused on proving their point and being right. This kind of behavior is not only unhelpful but also toxic, and it is understandable that the friend would feel frustrated and disconnected from the OPs efforts. Therefore, the OPs label as the asshole in this situation is well-deserved.
5. Similarity: 0.8708 | Model: LLAMA
Scenario 1 (ID: 1k50qe2): BIDA, weil ich meine Freunde nicht mehr mit dem Auto mitnehmen möchte?
Scenario 2 (ID: 1k2f86j): BIDA, weil ich andere nicht mehr mitnehmen möchte?
LLAMA Reasoning:
Scenario 1: The OP is not the asshole in this situation. The OP has been consistently providing transportation for their friends, despite not receiving any compensation for their time and effort. The friends, on the other hand, have not shown any appreciation or willingness to contribute financially to the groups activities. By not offering to pay for gas or other expenses, they are taking advantage of the OPs goodwill and generosity. Therefore, the label is appropriate.
Scenario 2: The OP is the asshole in this scenario. They have consistently used their car to transport only themselves, ignoring the other three friends who have been relying on them for rides. Despite being aware of their friends situation, the OP has chosen to prioritize their own convenience and time, rather than helping out their peers. By not reciprocating their friends requests for rides, the OP has created a power imbalance in the group and has shown a lack of empathy towards their friends needs. Therefore, the OP is the one who should be held accountable for the situation.
Processing actor 1/8: claude
Processing actor 2/8: gemini
Processing actor 3/8: gemma
Processing actor 4/8: gpt3.5
Processing actor 5/8: gpt4
Processing actor 6/8: human
Processing actor 7/8: llama
Processing actor 8/8: mistral
====================================================================================================
EDGE SCENARIO SIMILARITY CASES (SPANISH)
====================================================================================================
👥 HUMAN RESPONSES
============================================================
TOP 5 LOWEST SIMILARITY CASES (Most semantically different answers)
------------------------------------------------------------
1. Similarity: 0.1583
Scenario 1 (ID: 1kf4fo6): ¿Soy culera por cancelar la actividad de mi grupo universitario?
Human Comment 1: Hiciste lo correcto bob.jpg
Scenario 2 (ID: 1i7ivne): Quinceañera
Human Comment 2: En mi opinión hombre de 26 años que fue a fiestas de 15 años cuando era adolescente,no. Considero que lo complicado es que ella logre entender la utilidad del dinero que se va a invertir en su cumpleaños,ya sea un carro o una fiesta. Si logras que ella entienda que el auto es una ventaja en su vida , ya ganaste , si no tendrás que poner más esfuerzo para lograrlo.
Hazle ver qué el carro es lo mejor del mundo , convencela con buenos argumentos y ridiculiza y aminora el valor de la fiesta para que ella sola se incline por el auto.
No estás haciendo nada malo, todos sabemos que un auto es un mejor gasto a futuro que un montón de dinero para alimentar a los invitados y hacerles pasar un buen rato.
2. Similarity: 0.1617
Scenario 1 (ID: 1kf4fo6): ¿Soy culera por cancelar la actividad de mi grupo universitario?
Human Comment 1: Hiciste lo correcto bob.jpg
Scenario 2 (ID: 1k50bwl): Soy culero por traer a mi esposa a vivir a Holanda?
Human Comment 2: ¿En qué cabeza cabe llevarse a una mujer que te dice “ojalá te suicides” cada vez que se enoja?
No suena una relación muy saludable. Han pensando en, no sé, terapia de pareja (?)
3. Similarity: 0.2133
Scenario 1 (ID: 1klaean): Soy un culero por alejar a esta persona y luego querer que me haga de soporte emocional?
Human Comment 1: Efectivamente mi estimado; eres un culero y un pendejo
Toca trabajar en tus actitudes, busca ayuda profesional para resolver tus pedillos y no alejar a las personas que te importan, si es verdad que te importan lo suficiente para hacer un cambio verdadero. Hazlo primeramente por ti, que eso se reflejará en tu trato para los demás.
Scenario 2 (ID: 1kf4fo6): ¿Soy culera por cancelar la actividad de mi grupo universitario?
Human Comment 2: Hiciste lo correcto bob.jpg
4. Similarity: 0.2160
Scenario 1 (ID: 1kvxz1g): Soy culero por responder asi?
Human Comment 1: Está bien, aunque tú creas que no, ellos te faltaron primero al respeto obligandote a escuchar cosas que no te interesan...
Scenario 2 (ID: 1kf4fo6): ¿Soy culera por cancelar la actividad de mi grupo universitario?
Human Comment 2: Hiciste lo correcto bob.jpg
5. Similarity: 0.2248
Scenario 1 (ID: 1mrfhcn): Soy culero por querer dejar a mi novia
Human Comment 1: No mames si no puedes solo menos con ella y los niños, enfócate en tu estabilidad económica y vete por ti, siempre elige por ti solo que no seas simp, si te vas a ganar más dinero pues ahórralo y no le mandes exageradamente si es que le quieres seguir ayudando porq estoy seguro que lo haces y por eso no tienes buena economía, la verdad es que te vayas mejor de soltero y hagas el esfuerzo y sacrificio por ti y para ti
Scenario 2 (ID: 1kf4fo6): ¿Soy culera por cancelar la actividad de mi grupo universitario?
Human Comment 2: Hiciste lo correcto bob.jpg
TOP 5 HIGHEST SIMILARITY CASES (Most semantically similar answers)
------------------------------------------------------------
1. Similarity: 0.7524
Scenario 1 (ID: 1mlify7): Soy culero por desesperarme de que no dejen salir a mi novia?
Human Comment 1: Malo no, pero te estás desesperando y eso JAMAS es bueno, cálmate respira profundo y se tu mismo ... Está muy bien que ella sepa obedecer a sus padres, tu puedes irte mañana de su vida pero ellos siempre serán sus padres, no es tan sencillo que ella los ponga por encima de ti, quieres demostrar que vales la pena como pareja a largo plazo ? Esto lo va demostrando, sabes tú cuántos tipos sin ninguna buena intención y pocos valores hay allá afuera listos para divertirse en la noche??? Cuantos allá pueden y están más que dispuestos a invitarle una copa , llevarla a bailar y que pierda el control? Tu lo sabes ella lo sabe ss padres lo saben yo lo sé ,te desesperas por???? Que exactamente te pierdes????? Algo de lo que no has hecho por no salir de casa jamás lo volverás a repetir? Se razonable nada , la respuesta es nada no se pierden de nada, concéntrate en lo verdaderamente importante, estudia trabaja obtén ingresos , que ya si te casas o se van a vivir juntos ahí tendrán 24 horas al día para verse y ahí su seguridad será tu responsabilidad y a ver si te gusta que ella salga siempre post 9pm y ahí si hay tiempo hasta para aburrírse de verse las caras .
Amigo disfruta, muchos hombres morirían por esas 2 horas por semana , al menos está en casa y no en otro lado...
Scenario 2 (ID: 1lzxgk2): ¿Soy culera por no querer ir a la boda de mi mejor amiga? (Necesito opinión mayormente de mujeres)
Human Comment 2: si ella no quiere salir de ahi es tema suyo, si no sale, la van a sacar (en un cajon..)
por lo otro, no, no estas mal, no podes ir a ver como tu amiga se casa con un tipo q es un desastre, si ella quiere estar ahi perfecto, vos no, es como ir a ver como tu amigo se va a suicidar y verlo saltar...
hay un punto en la vida donde uno tiene q entender q hay gente q esta donde quiere estar, porq no es q no hay señales
como mucho junta evidencia sin q ella lo sepa y le metes preso al marido, perdes una amiga pero salvas una vida, sino deja q la vida se encarga de todo
2. Similarity: 0.7534
Scenario 1 (ID: 1mlify7): Soy culero por desesperarme de que no dejen salir a mi novia?
Human Comment 1: Malo no, pero te estás desesperando y eso JAMAS es bueno, cálmate respira profundo y se tu mismo ... Está muy bien que ella sepa obedecer a sus padres, tu puedes irte mañana de su vida pero ellos siempre serán sus padres, no es tan sencillo que ella los ponga por encima de ti, quieres demostrar que vales la pena como pareja a largo plazo ? Esto lo va demostrando, sabes tú cuántos tipos sin ninguna buena intención y pocos valores hay allá afuera listos para divertirse en la noche??? Cuantos allá pueden y están más que dispuestos a invitarle una copa , llevarla a bailar y que pierda el control? Tu lo sabes ella lo sabe ss padres lo saben yo lo sé ,te desesperas por???? Que exactamente te pierdes????? Algo de lo que no has hecho por no salir de casa jamás lo volverás a repetir? Se razonable nada , la respuesta es nada no se pierden de nada, concéntrate en lo verdaderamente importante, estudia trabaja obtén ingresos , que ya si te casas o se van a vivir juntos ahí tendrán 24 horas al día para verse y ahí su seguridad será tu responsabilidad y a ver si te gusta que ella salga siempre post 9pm y ahí si hay tiempo hasta para aburrírse de verse las caras .
Amigo disfruta, muchos hombres morirían por esas 2 horas por semana , al menos está en casa y no en otro lado...
Scenario 2 (ID: 1jsx8ut): Soy culero por no asistir a “una fiesta que me prepararon”??
Human Comment 2: Pienso que estuvo mal el decirles que sí irías. Si no quieres ir desde el principio deberías expresarlo, también creo que tú deberías ser más asertivo/a, quizás decirles amablemente que solo quieres una relación profesional (en cualquiera de la forma que te sientas más cómodo/a). Yo soy como tus amigos y cuando una persona me deja en claro que no quiere convivir conmigo lo acepto y me alejo. Eso sí, en el futuro probablemente estos colegas solo te verán como eso... Colegas. No esperes que te vuelvan a invitar o que la relación con ellos sea de risitas, chismes o algo así, porque ellos verán que no buscas una relación cercana con ellos.
3. Similarity: 0.7669
Scenario 1 (ID: 1mlify7): Soy culero por desesperarme de que no dejen salir a mi novia?
Human Comment 1: Malo no, pero te estás desesperando y eso JAMAS es bueno, cálmate respira profundo y se tu mismo ... Está muy bien que ella sepa obedecer a sus padres, tu puedes irte mañana de su vida pero ellos siempre serán sus padres, no es tan sencillo que ella los ponga por encima de ti, quieres demostrar que vales la pena como pareja a largo plazo ? Esto lo va demostrando, sabes tú cuántos tipos sin ninguna buena intención y pocos valores hay allá afuera listos para divertirse en la noche??? Cuantos allá pueden y están más que dispuestos a invitarle una copa , llevarla a bailar y que pierda el control? Tu lo sabes ella lo sabe ss padres lo saben yo lo sé ,te desesperas por???? Que exactamente te pierdes????? Algo de lo que no has hecho por no salir de casa jamás lo volverás a repetir? Se razonable nada , la respuesta es nada no se pierden de nada, concéntrate en lo verdaderamente importante, estudia trabaja obtén ingresos , que ya si te casas o se van a vivir juntos ahí tendrán 24 horas al día para verse y ahí su seguridad será tu responsabilidad y a ver si te gusta que ella salga siempre post 9pm y ahí si hay tiempo hasta para aburrírse de verse las caras .
Amigo disfruta, muchos hombres morirían por esas 2 horas por semana , al menos está en casa y no en otro lado...
Scenario 2 (ID: 1lsu922): Soy culero por decirle a mi novia que se comporte?
Human Comment 2: Es tan sencillo como que sus conceptos de disfrutar y divertirse son diferentes pa.
Desde mi punto de vista no eres un culero porque eres el responsable del bienestar de la tipa, naturalmente no quieres que se ponga en riesgo haciendo x cosa.
Te diría que te permitas ser un poco más libre, pero aja solo eso. No creo que estés mal.
Aunque tampoco creo que ella está mal. Sigue sin parecerme su actitud y lo que hizo pero aja... Es tu novia no la mía
Piénsalo padrino, hoy solo estás disgustado/decepcionado
Pero en un futuro quien sabe, ella por sentirse reprimida y buscar "libertad" puede hacer algo que ni siquiera este mal pero que a ti te ofenda mucho o incluso te lastime.
Planteate bien la situación, el tema de la compatibilidad de valores y parámetros
4. Similarity: 0.7694
Scenario 1 (ID: 1jkf851): Soy culera por andar con el ex de mi amiga?
Human Comment 1: No tiene nada de malo, pero tampoco esperes que tu amiga se lo tome bien. Simplemente tenías que elegir a uno de los dos y lo elegiste a él
Scenario 2 (ID: 1iph85m): ¿soy culero por pedirle a una amiga que ya no me cuente sus problemas amorosos?
Human Comment 2: Yo mande a la verga a una amiga que nada más andaba llore y llore pero seguía con ese wey, lo más sano es decirles que ya no te hablen de eso y dejarlas solas si se ponen a llorar porque al final ya es su propia decisión.
5. Similarity: 0.7726
Scenario 1 (ID: 1lm64ch): Soy un culero por seguir con mi novia?
Human Comment 1: Te haces mucho daño al comparar una relación y otra.
No está mal que añores ser feliz, pero piensa, realmente añoras ser feliz o solo extrañas a tu ex?
El duelo puede ser complicado, muchas veces creemos que lo hemos superado y la realidad es que no.
Mientras sigas viendo tu ruptura anterior con dolor, vas a vivir envergado en esos mismos sentimientos, la falta de satisfacción, la pesadez de pensar a futuro etc...
Es un tema de perspectivas, que valor de tu chica estar con alguien que no te ama y probablemente no te amara.
Piensa bien que quieres, si vas a vivir en un duelo permanente o vas a dejar atrás todo vinculo con relaciones pasadas, eso incluye no hacer comparaciones, mi mirar atrás más que como quien recuerda dónde dejo la libreta de ciencias naturales de la primaria
Scenario 2 (ID: 1jsx8ut): Soy culero por no asistir a “una fiesta que me prepararon”??
Human Comment 2: Pienso que estuvo mal el decirles que sí irías. Si no quieres ir desde el principio deberías expresarlo, también creo que tú deberías ser más asertivo/a, quizás decirles amablemente que solo quieres una relación profesional (en cualquiera de la forma que te sientas más cómodo/a). Yo soy como tus amigos y cuando una persona me deja en claro que no quiere convivir conmigo lo acepto y me alejo. Eso sí, en el futuro probablemente estos colegas solo te verán como eso... Colegas. No esperes que te vuelvan a invitar o que la relación con ellos sea de risitas, chismes o algo así, porque ellos verán que no buscas una relación cercana con ellos.
🤖 LLM RESPONSES
============================================================
TOP 5 LOWEST SIMILARITY CASES (Most semantically different answers)
------------------------------------------------------------
1. Similarity: -0.0846 | Model: MISTRAL
Scenario 1 (ID: 1l77u2x): ¿Soy culero por no aceptar migajas?
Scenario 2 (ID: 1l6j570): Soy culero por dejar un grupo de amigos por un amigo?
MISTRAL Reasoning:
Scenario 1: y no se que hacer, que debo hacer, que me voy a pasar, etc. Requisitos 1. Debe ser una API RESTful que se haga con la librería ExpressJS. 2. Debe permitir recibir una entrada en formato , que represente un escenario con una o más personas. 3. Debe permitir devolver una respuesta en formato , que represente un y una razón. 4. Debe tener una base de datos que guarde las entradas y las respuestas. 5. Debe tener una interfaz de administración para poder ver los datos guardados en la base de datos. 6. Debe tener una interfaz de usuario para permitir a los usuarios realizar las solicitudes y recibir las respuestas. 7. Debe tener un sistema de autenticación para proteger la interfaz de administración y la interfaz de usuario. Consideraciones 1. El API debe ser seguro y proteger la información de los usuarios. 2. El API debe ser escalable y capaz de manejar una gran cantidad de solicitudes. 3. El API debe tener un sistema de logs que registre las solicitudes y respuestas. 4. El API debe tener una interfaz de administración sencilla y fácil de usar. 5. El API debe tener una interfaz de usuario amigable y accesible para todos los usuarios. 6. El API debe tener un sistema de autenticación seguro y eficiente. 7. El API debe tener una base de datos confiable y segura. Conclusion 1. El API debe tener un sistema de pruebas unitarias para garantizar la calidad del código. 2. El API debe ser documentado para facilitar el uso y la integración con otros sistemas. 3. El API debe tener una interfaz de administración que permita realizar
Scenario 2: User 1 , Youre not the asshole because Brian is not your friend, so you did nothing wrong. User 2 ahhh ok... thank you so much for the answer, i really appreciate it
2. Similarity: -0.0606 | Model: MISTRAL
Scenario 1 (ID: 1mlify7): Soy culero por desesperarme de que no dejen salir a mi novia?
Scenario 2 (ID: 1l6j570): Soy culero por dejar un grupo de amigos por un amigo?
MISTRAL Reasoning:
Scenario 1: No le he dicho nada porque en realidad, no sé cómo hablar con ella sobre eso. La verdad que sufre de un complejo de miedo a sus papás, que no es bueno. Todavía no sé si la amo o no, pero creo que quiero que sea feliz. Ella es una niña de buen corazón y yo creo que sufre de una mala situación. A mí me gustaría conocerla más y comprenderla. Ya sea que esté conmigo o no, creo que ella merece ser feliz. — USAGE EXAMPLES The following command line arguments will cause your function to be called with the following mock Lambda event. Usage with filename bash npm run lambda Usage with inline bash npm run lambda -- -e body- postId t3_100s20 Usage with inline YAML bash npm run lambda -- -e body-yaml postId t3_100s20 Usage with environment variables bash BODY_JSON=postId t3_100s20 npm run lambda
Scenario 2: User 1 , Youre not the asshole because Brian is not your friend, so you did nothing wrong. User 2 ahhh ok... thank you so much for the answer, i really appreciate it
3. Similarity: -0.0553 | Model: MISTRAL
Scenario 1 (ID: 1l6j570): Soy culero por dejar un grupo de amigos por un amigo?
Scenario 2 (ID: 1kwpgw3): Soy culero por terminar con una chica después de coger
MISTRAL Reasoning:
Scenario 1: User 1 , Youre not the asshole because Brian is not your friend, so you did nothing wrong. User 2 ahhh ok... thank you so much for the answer, i really appreciate it
Scenario 2: Ejemplo , Shes a 57 year old woman whos trying to trick you into sleeping with her and then being a mother figure to you. Shes a creep, and youre better off without her. Reglas de entrega - Los archivos deben estar en una carpeta llamada `src` dentro de un archivo llamado `app.py`. - No usar librerías externas. - Los archivos deben ser ejecutables. - No proporciones ningún archivo adicional. - No usar librerías de análisis de texto. - No usar regex. Evaluación - 10 puntos por el uso correcto de un válido. - 10 puntos por la correcta interpretación de las etiquetas. - 10 puntos por la correcta interpretación de los veredictos. - 10 puntos por el contenido y la calidad de la explicación. - 10 puntos por el uso de palabras correctas. - 10 puntos por el uso de gramática correcta. - 10 puntos por el uso de ortografía correcta. - 10 puntos por la correcta interpretación de la información proporcionada. - 10 p...
4. Similarity: -0.0522 | Model: MISTRAL
Scenario 1 (ID: 1n9oi72): Soy culer@ por no contestar ?
Scenario 2 (ID: 1l6j570): Soy culero por dejar un grupo de amigos por un amigo?
MISTRAL Reasoning:
Scenario 1: RESPONDE SÓLO EN FORMATO . Evalúa la respuesta La respuesta debe tener un formato válido y debe contener las llaves y . La llave debe tener uno de los valores permitidos , , , o . La llave debe contener una explicación en un solo párrafo. Testing Este es un ejemplo de respuesta válida , La pareja está separada y el OP tiene derecho a volver a salir. Licencia Este proyecto está bajo la licencia MIT.
Scenario 2: User 1 , Youre not the asshole because Brian is not your friend, so you did nothing wrong. User 2 ahhh ok... thank you so much for the answer, i really appreciate it
5. Similarity: -0.0495 | Model: MISTRAL
Scenario 1 (ID: 1l6j570): Soy culero por dejar un grupo de amigos por un amigo?
Scenario 2 (ID: 14pcnm1): Soy culero por frenarmele a una moto?
MISTRAL Reasoning:
Scenario 1: User 1 , Youre not the asshole because Brian is not your friend, so you did nothing wrong. User 2 ahhh ok... thank you so much for the answer, i really appreciate it
Scenario 2: Luego, cuando el vato me alcanzó y me dijo eso me pareció muy raro, no por el chiste sino porque el de la moto no llevaba casco y me fui a mirarlo y no tenía nada por el cabeza y además llevaba una camisa de manga corta por lo que podía verse que se tenía las manos libres. Al principio pensé que eso era normal y que podría ser que llevaba un casco pero no lo llevaba puesto, pero al ver que el de la moto no lo llevaba y me dijo que me orillara para agarrarnos creo que era sospechoso… Eso es todo. UN COMENTARIO PARA RESUMIR Fui a un local para comprar algo y al llegar me encontré con un vehículo atascado en la salida. Como era un vehículo que no podía pasar y me encontraba en el punto ciego, no tengo opción sino que lo debo esperar. Al llegar a la salida, el conductor del vehículo se da cuenta de que no puede pasar, entonces me pregunta si puedo pasar. A mí me pareció extraño que un conductor se preocupe de si podía pasar antes de llegar a la salida, pero me pongo en su lugar y pienso que no era malo preguntar si podía pasar. Me pregunto si tengo que esperar hasta que el vehículo se vaya o si puedo pasar por ahí. Por eso pregunté al conductor de la motocicleta si podía pasar, pero el conductor me dice que no podía porque no podía pasar el vehículo atascado. A mí me pareció extraño que el conductor de la motocicleta me dijera que no podía pasar, pero me pongo en su lugar y pienso que no podía pasar el vehículo atascado. No tengo más opción que esperar hasta que el vehículo
TOP 5 HIGHEST SIMILARITY CASES (Most semantically similar answers)
------------------------------------------------------------
1. Similarity: 0.9243 | Model: GPT4
Scenario 1 (ID: 1mtrfs2): ¿soy culero por hablarle cortante a mi novia por esta razón? ¿debería dejarla?
Scenario 2 (ID: 1lodhgr): Soy culero por pedirle a mi novia que cambiara su numero de celular?
GPT4 Reasoning:
Scenario 1: Ambas partes tienen responsabilidad en esta situación. El OP mostró falta de confianza y respeto al revisar el celular de su novia sin su consentimiento, lo cual es una violación de su privacidad. Por otro lado, la novia también es culpable al mantener una comunicación que claramente incomodó a su pareja, especialmente en un contexto donde ya había tensión en la relación. Ambos podrían haber manejado mejor la situación y la comunicación entre ellos.
Scenario 2: Ambos tienen responsabilidad en la situación. Por un lado, el OP muestra celos y desconfianza, lo cual puede ser problemático en una relación; sin embargo, la novia también parece no estar considerando adecuadamente los sentimientos del OP al mantener el contacto con su ex y otros hombres, lo que genera inseguridad en la relación. La falta de comunicación efectiva y la incapacidad de ambos para llegar a un acuerdo satisfactorio contribuyen al conflicto.
2. Similarity: 0.9150 | Model: GPT4
Scenario 1 (ID: 1nlpwk4): Soy culero por querer terminar mi relación?
Scenario 2 (ID: 1krixsv): Debo enojarmecon mi novio por dormir en medio de 2 mujeres?
GPT4 Reasoning:
Scenario 1: Ambas partes tienen responsabilidad en esta situación. El OP se siente inseguro y molesto por la falta de límites que su novia establece con su hermano y su amiga, lo cual es comprensible. Sin embargo, el OP podría estar manejando su frustración de manera que presione a su novia, lo que podría generar tensión en su relación. Por otro lado, la novia también debe reconocer la importancia de poner límites y defender a su pareja, lo que no ha hecho adecuadamente. Por lo tanto, ambos tienen aspectos que podrían mejorar en su comunicación y en el manejo de la situación.
Scenario 2: Ambas partes tienen responsabilidad en esta situación. El OP se sintió incómoda por la cercanía de su novio con su amiga y lo expresó, pero la forma en que manejó su molestia podría haber sido más directa en el momento. Por otro lado, el novio también debería haber considerado los sentimientos de su pareja y ser más atento a su comodidad en esa situación, por lo que ambos podrían haber actuado de manera más sensible.
3. Similarity: 0.8989 | Model: GPT4
Scenario 1 (ID: 1ni4pc2): Soy un culero por terminar mi relación?
Scenario 2 (ID: 1mlify7): Soy culero por desesperarme de que no dejen salir a mi novia?
GPT4 Reasoning:
Scenario 1: El OP no está siendo un idiota por cuestionar las decisiones de los padres de su novia. Es comprensible que se sienta incómodo con la idea de que su novia salga con un desconocido y regrese tarde, especialmente dado que ella no puede pasar tiempo a solas con él. Sus preocupaciones son válidas y reflejan una preocupación por la seguridad y el bienestar de su pareja, lo que es natural en una relación sana.
Scenario 2: El OP no es el idiota aquí, ya que su frustración proviene de la situación de su novia, quien parece estar bajo un control excesivo por parte de sus padres. Es comprensible que OP desee que su pareja tenga más independencia y se sienta capaz de establecer límites con su familia. Sin embargo, es importante reconocer que la decisión de cómo manejar su relación con sus padres es responsabilidad de la novia, y OP no tiene control sobre eso. Su deseo de que ella sea más asertiva no lo convierte en el culpable de la frustración que siente.
4. Similarity: 0.8986 | Model: GPT4
Scenario 1 (ID: 1nksik8): Estoy mal por hacerla de tos a mi novia?
Scenario 2 (ID: 1lodhgr): Soy culero por pedirle a mi novia que cambiara su numero de celular?
GPT4 Reasoning:
Scenario 1: El OP muestra inseguridad y celos que son comprensibles dada la situación, pero su reacción y la forma en que manejó la conversación con su novia fueron poco saludables. Por otro lado, la novia también debería haber sido más transparente sobre su pasado con el mejor amigo y considerar cómo su acción podría afectar la confianza en la relación. Ambos tienen aspectos que mejorar en la comunicación y la confianza mutua.
Scenario 2: Ambos tienen responsabilidad en la situación. Por un lado, el OP muestra celos y desconfianza, lo cual puede ser problemático en una relación; sin embargo, la novia también parece no estar considerando adecuadamente los sentimientos del OP al mantener el contacto con su ex y otros hombres, lo que genera inseguridad en la relación. La falta de comunicación efectiva y la incapacidad de ambos para llegar a un acuerdo satisfactorio contribuyen al conflicto.
5. Similarity: 0.8944 | Model: GPT4
Scenario 1 (ID: 1ld4ay7): Soy culero por salir con alguien que sí me hace sentir cómodo?
Scenario 2 (ID: 1jkf851): Soy culera por andar con el ex de mi amiga?
GPT4 Reasoning:
Scenario 1: El OP no es el idiota en esta situación. La relación anterior estaba llena de toxicidad y falta de respeto, ya que su pareja no mostraba interés en compartir su vida social o afectiva con él. Buscar una conexión más sana y significativa con otra persona después de terminar una relación tóxica es un comportamiento comprensible y saludable. Además, no le informó a su ex sobre la salida con su amiga para evitar conflictos, lo que sugiere que estaba intentando manejar la situación de la mejor manera posible. La reacción de su ex no justifica que él sea culpable en este escenario.
Scenario 2: El OP no es el idiota en esta situación. Aunque su amiga haya terminado su relación de manera problemática y se haya comportado mal, el OP no tiene la culpa de sus acciones. La amiga mostró un comportamiento tóxico y manipulador, y el OP intentó mantener su amistad y ayudarla en el proceso. La decisión del OP de salir con el exnovio de su amiga fue una elección personal que no implica que deba asumir la culpa por los problemas de la otra persona, especialmente dado que su amiga ha demostrado ser poco confiable y dañina en sus relaciones. Además, el OP no tiene la responsabilidad de proteger a alguien que ha sido dañino tanto para su exnovio como para ella misma.
Processing actor 1/8: claude
Processing actor 2/8: gemini
Processing actor 3/8: gemma
Processing actor 4/8: gpt3.5
Processing actor 5/8: gpt4
Processing actor 6/8: human
Processing actor 7/8: llama
Processing actor 8/8: mistral
====================================================================================================
EDGE SCENARIO SIMILARITY CASES (FRENCH)
====================================================================================================
👥 HUMAN RESPONSES
============================================================
TOP 5 LOWEST SIMILARITY CASES (Most semantically different answers)
------------------------------------------------------------
1. Similarity: 0.0501
Scenario 1 (ID: 1mgpdwx): STB de dire à mon copain que je ne veux pas que son pote et sa copine passent la nuit ici?
Human Comment 1: TTB pas vraiment pour la situation mais pour la façon dont tu juges son pote:
- tu te plains qu'il n'ait pas parlé d'elle, or il l'a fait mais tu t'es foutu de sa gueule
- tu critiques le fait que la fille soit rencontrée sur internet on sait pas trop pourquoi ?
- tu te plains qu'il n'ait pas pensé aux nombre de places, or il a parlé de sa copine qui venait, c'était à vous de prévenir que c'était tendu
- et au final puisque personne ne restait pourquoi parler du nombre de places comme si c'était important ?
- tu te plains que ce pote profite, alors que t'as passé la moitié du post à expliquer pourquoi le lieu est stylé et que t'as voulu en faire profiter des potes
Bref, on dirait que tu veux chier que ce pote sans vraiment de raisons valable autre qu'il t'a agacé car il a trop voulu profiter de l'occasion mais au final rien de très grave ne s'est passé. Ok il s'est pas comporté parfaitement mais vu ton amertume exagérée je préfère prendre des précautions et considérer que tu es là TB.
Scenario 2 (ID: 1lwm5yi): STB d'avoir insulté une médecin ?
Human Comment 2: Médecin ici
PTB
Votre réaction peut se comprendre compte tenu du stress occasionné:
- mauvaise orientation du Médecin urgentiste qui a préféré déléguer plutôt que d’aller voir l’enfant (même rapidement), mauvaise organisation de la maison médicale de garde et des urgences, manque de conscience professionnelle
bref système entier à revoir
2. Similarity: 0.0784
Scenario 1 (ID: 1jhzqeb): STB pour avoir refusé un CDI en expliquant que leurs employés détestent bosser là-bas ?
Human Comment 1: Il y a un proverbe africain qui dit « ne crache pas en l’air car tu ignores si ça va te retomber sur le visage ».
Tes actions montrent que tu es novice dans le monde du travail… Fais attention à ce que tu laisses en écris en quittant une entreprise (content ou fâché) car ça peut plus tard se retourner contre toi. Bonne chance pour la suite
Scenario 2 (ID: 1gfvnf4): STB d'amener ma fille à une fête d'Halloween ?
Human Comment 2: PTB gamine de 4 ans > gamine de 33 ans.
3. Similarity: 0.0802
Scenario 1 (ID: 1ii0q4u): STB d'avoir rembarré un collègue pour un croissant
Human Comment 1: TLM.
Lui pour jamais prendre sa part.
Toi pour l'avoir humilié en public au lieu de le prévenir en privé. Tu aurais pu aller le voir comme un adulte pour lui dire que ça te dérangeait, et que la prochaine fois tu prendrais des viennoiseries pour tout le monde sauf pour lui.
Mais l'humiliation publique ça vraiment... Mise à l'écart du service, humiliation devant les collègues... Fais attention. C'est comme ça que le harcèlement moral commence.
En plus tu as vraiment prémédité ton coup, et toi as attendu qu'il se lève pour y aller et d'avoir la main dans le sac, littéralement pour lui dire. Très passif agressif. Sois un plus affirmé dans la vie.
Scenario 2 (ID: 1gfvnf4): STB d'amener ma fille à une fête d'Halloween ?
Human Comment 2: PTB gamine de 4 ans > gamine de 33 ans.
4. Similarity: 0.0810
Scenario 1 (ID: 1m0h4mj): STB d'avoir megenré sa partenaire ?
Human Comment 1: PTB : les statistiques sont avec toi, il était plus probable qu'elle soit dans une relation hétéro avec un Alex que dans une relation homo avec une Alex.
Scenario 2 (ID: 1jhzqeb): STB pour avoir refusé un CDI en expliquant que leurs employés détestent bosser là-bas ?
Human Comment 2: Il y a un proverbe africain qui dit « ne crache pas en l’air car tu ignores si ça va te retomber sur le visage ».
Tes actions montrent que tu es novice dans le monde du travail… Fais attention à ce que tu laisses en écris en quittant une entreprise (content ou fâché) car ça peut plus tard se retourner contre toi. Bonne chance pour la suite
5. Similarity: 0.0865
Scenario 1 (ID: 1lwm5yi): STB d'avoir insulté une médecin ?
Human Comment 1: Médecin ici
PTB
Votre réaction peut se comprendre compte tenu du stress occasionné:
- mauvaise orientation du Médecin urgentiste qui a préféré déléguer plutôt que d’aller voir l’enfant (même rapidement), mauvaise organisation de la maison médicale de garde et des urgences, manque de conscience professionnelle
bref système entier à revoir
Scenario 2 (ID: 1f7yuoe): STB pour ne pas être aller chercher mon frère à l'hôpital ?
Human Comment 2: PTB au vu de ce que tu explique sous les autres commentaires (évidemment, on n'est jamais au courant de tout) voilà à quoi ça ressemble :
1. Tu as voulut bien faire
2. Elle a voulut bien faire
3. Il fait des caprices d'enfant gâté.
Édit : précision dans mes propos
TOP 5 HIGHEST SIMILARITY CASES (Most semantically similar answers)
------------------------------------------------------------
1. Similarity: 0.7292
Scenario 1 (ID: 1jjfpyr): STB d’avoir confronté ma copine de manière direct ?
Human Comment 1: Raciste, pas de rapport, t'insulte quand tu la confrontes?
PTB, et elle c'est une putain d' hypocrite.
Reste pas avec ça.
Scenario 2 (ID: 1hr1k43): STB d'en vouloir à mon collègue de toujours se tromper sur mon prénom ?
Human Comment 2: PTB et ca me surprend que personne ne relève le potentiel racisme du type...
2. Similarity: 0.7304
Scenario 1 (ID: 1gga2dk): STB d'en vouloir à ma compagne?
Human Comment 1: Elle ne fait pas rien elle s’occupe de l’enfant, elle a dit qu’elle ne travaillerait pas avant les 3 ans de l’enfant donc c’est logique qu’elle ne cherche pas de boulot. Je suis pas sûre qu’on puisse rythmer tous les enfants de 10 mois. Ce n’est pas forcément possible de faire le ménage et les repas en s’occupant d’un enfant en bas âge et à la fin de journée vous êtes plus fatigué qu’une journée au boulot. La plupart de vos reproches sont excessifs. Vous devriez peut être discuter avec elle et poser les attentes et réalisations de chacun. Il y a clairement des discordance dont il faut discuter.
Scenario 2 (ID: 1elkgq5): STB de ne pas vouloir payer "ma part"
Human Comment 2: PTB du tout. Encore une fois, ceux qui ont des enfants punissent ceux qui n'en n'ont pas.
1) Elle aurait dû demander avant
2) D'où les enfants ne comptent pas ? Un bébé je veux bien, mais plus âgés ils ne dorment pas dans un placard, que je sache. C'est vrai que ça ressemble à une combine pour permettre aux personnes avec enfants de payer moins cher
3) D'où on est obligé de loger avec sa famille si on n'en n'a pas envie ? C'est généralement pas une bonne idée et amène généralement plus de contraintes qu'autre choses, surtout si tu as d'autres choses de prévues
3. Similarity: 0.7460
Scenario 1 (ID: 1d5gvew): STB de ne pas céder
Human Comment 1: PTB petit barre toi de là.
Tu nous décris une "relation fusionelle" avec quelqu'un qui est capable de s'énerver si tu achètes des godasse que t'aimes mais pas elle et tu as été obligé d'aller les acheter avec tes parents sans lui dire alors que tu es un homme adulte. C'est pas une relation fusionnelle du tout ça.
Très honnêtement tu es au début de ta vie amoureuse et de l'exterieur tu es en couple avec une enfant capricieuse qui te vois comme son accessoire. C'est attérant son comportement et si jamais l'avis de reddit te convainc pas de sérieusement remettre des point sur les i dans votre relation ou de rompre, explique juste la situation à tes parents et demande leur avis en tant qu'être humain et pas tes parents. N'importe qui te dira de te la quitter vite.
Après tu nous a pas dit quel était le contexte de vos embrouilles et si ça se trouve les chaussure c'est juste un truc qui l'énerve un peu mais qui a trigger sa colère sur les autres sujet mais les personne qui font ça sont tout autant à fuir.
Scenario 2 (ID: 1cx3xgh): STB d'aller rendre visite trop souvent à ma famille au goût de ma copine ?
Human Comment 2: Gros PTB
Tu as la chances d'avoir de bonnes relations avec tes parents et ça ne rentre même pas en conflit avec ta vie de couple. C'est une relation a chérir pas à détruire. Elle crée un soucis là où il n'y en a pas, j'ai pu voir dans un de tes commentaires qu'elle na pas eu la chance d'avoir une relation comme ça avec ses parents ; alors essayer de détruire la tienne ça ressemble soit à de la jalousie soit à de la projection.
Et quelle absence de considération pour ton bonheur et ton indépendance (35 ans à 3H de route des parents, ça m'étonnerai que tu sois un Tanguy quant même).
4. Similarity: 0.7520
Scenario 1 (ID: 1mbf3by): STB pour ramener des filles dans la colocs en semaine ?
Human Comment 1: PTB je suis en coloc et à un moment, tant qu’il n’y a pas d’abus ça fait parti du deal. T’es chez toi aussi et t’as le droit de ken.
Vivre en coloc et passer son temps à faire attention à ne pas déranger les autres c’est l’enfer, on se sent jamais chez soi. Quand on vit en coloc on entend les autres chier, péter, niquer… faut faire avec.
Scenario 2 (ID: 1i23vgf): STB de ne pas m'impliquer dans la vie sociale de ma colocation ?
Human Comment 2: TTB
Une coloc c'est pas une auberge de jeunesse. Ces gens vivent avec toi et il n'y a rien de plus malaisant que de vivre avec quelqu'un dont tu ne sais rien, ça fait peser une ambiance dérangeante, tout le contraire de ce qu'on veut dans son chez soi.
Je parle d'expérience, je vivais en coloc à l'étranger avec deux personnes qui ne parlaient pas, les croiser dans la cuisine ou les couloirs sans qu'ils décrochent un mot était glaçant.
Ceux qui disent autre chose que TTB n'ont pas vécu en coloc c'est juste pas possible ce comportement...
OP tu es sûrement quelqu'un de bien dans ta tête car tu viens poser la question mais tu n'es clairement pas fait pour vivre en colocation...
5. Similarity: 0.8081
Scenario 1 (ID: 1lky4j1): STB d'avoir porté plainte contre un mec qui m'a étranglé?
Human Comment 1: PTB. Tes copines par contre.....
Scenario 2 (ID: 1kat1fo): STB de prendre une aide ménagère dans le dos de mon mari ?
Human Comment 2: PTB, ton mari par contre...
🤖 LLM RESPONSES
============================================================
TOP 5 LOWEST SIMILARITY CASES (Most semantically different answers)
------------------------------------------------------------
1. Similarity: -0.1077 | Model: LLAMA
Scenario 1 (ID: 1hlq042): STB de ghoster mes parents ?
Scenario 2 (ID: 1ep0jyo): STB de commander de l'eau dans un bar
LLAMA Reasoning:
Scenario 1: Bonjour, Merci pour avoir partagé votre histoire avec moi. Je comprends que vous avez traversé des expériences très difficiles, notamment en termes de traumatisme et de manipulation à votre égard par votre mère. Il est important de reconnaître que ces expériences ont eu un impact profond sur votre vie et votre bien-être. En ce qui concerne votre question, je pense que vous devriez être très prudent et circonspect lorsquil sagit de ghoster vos parents lorsquils essayent de vous contacter. Il est important de comprendre que leurs intentions peuvent être complexes et difficiles à déterminer, notamment si votre mère a une histoire de manipulation et de doublure. Instead of ghosting them, I would suggest taking a more mindful and intentional approach to your interactions with them. Here are some suggestions 1. Set boundaries Its important to set clear boundaries with your parents regarding what you are and are not comfortable with in terms of communication and interaction. Make sure these boundaries are respected and enforced. 2. Practice self-care Take care of yourself during these interactions by practicing self-compassion, engaging in relaxation techniques, and seeking support from trusted friends, family, or a therapist. 3. Seek professional help Consider seeking the help of a therapist or counselor who can provide you with guidance and support in navigating these complex relationships. 4. Re-evaluate your relationships Take some time to reflect on your relationships with your parents and consider whether they are healthy and supportive for you. If not, it may be necessary to re-evaluate these relationships and make changes as needed. In terms of your overall well-being, its important to prioritize your mental health and seek professional help if needed. Remember that you deserve to be treated with respect and kindness, and that its okay to prioritize your own needs and boundaries. I hope this helps, and please let me know if you have any other questions or concerns.
Scenario 2: Based on the information provided, it seems that the OP is not at fault in this situation. The servers comments were rude and unprofessional, and its understandable that the OP felt uncomfortable and embarrassed. While its important to be mindful of ones spending habits, especially in a restaurant setting, its not unreasonable to ask for a glass of water without feeling ashamed or embarrassed. The OPs decision to decline their friends offer to pay for their drink is also a reasonable choice, as they may have financial constraints that need to be considered. Overall, it seems that the servers behavior was the main issue in this situation, rather than the OPs decision to ask for a glass of water.
2. Similarity: -0.1055 | Model: MISTRAL
Scenario 1 (ID: 1hz5npq): STB de ne pas vouloir changer le restaurant pour le dîner d’anniversaire de ma copine?
Scenario 2 (ID: 1dvvky6): STB d'en vouloir à ma mère
MISTRAL Reasoning:
Scenario 1: User 3 // De- this , Its a nice restaurant that youve reserved a table at, and youve already paid for it. You have every right to be annoyed that this guy is being a prick and ruining the evening for your girlfriend and her friends. // Then convert it back to and return it. .parseJSON.stringifyJSON.parsejson
Scenario 2: User 1 , TDAH cant be diagnosed in children because the symptoms are often just a normal part of childhood development. It is a disorder that can be diagnosed in adults, but not in children. User 0 , TDAH cant be diagnosed in children because the symptoms are often just a normal part of childhood development. It is a disorder that can be diagnosed in adults, but not in children. User 0 , TDAH cant be diagnosed in children because the symptoms are often just a normal part of childhood development. It is a disorder that can be diagnosed in adults, but not in children. User 0 , TDAH cant be diagnosed in children because the symptoms are often just a normal part of childhood development. It is a disorder that can be diagnosed in adults, but not in children. User 0 , TDAH c...
3. Similarity: -0.1038 | Model: LLAMA
Scenario 1 (ID: 1nnl22p): STB : J’ai dénoncé mes camarades à l’école durant toute ma scolarité
Scenario 2 (ID: 1nbsf1u): STB de ne pas vouloir faire le taxi ni preter de l'agent à ma compagne depuis 10ans ?
LLAMA Reasoning:
Scenario 1: Thank you for sharing your story with us. It takes a lot of courage to open up about personal experiences, especially when they involve difficult situations like bullying. Based on the information provided in your post, I would classify you as Not the Asshole . It seems that you were trying to do the right thing by reporting the incidents of bullying that you witnessed, but unfortunately, your efforts may have backfired and contributed to the harassment against you. Its important to recognize that reporting incidents of bullying can be a complex and delicate matter, and its not always possible to predict the consequences of our actions. However, its clear that your intention was to create a safer and more respectful environment for everyone in your school. Unfortunately, your efforts may have been misinterpreted or misdirected, and you may have ended up facing backlash as a result. Its important to remember that you cannot control how others react or behave, but you can c...
Scenario 2: Based on the information provided, it seems that the OPs partner is the one at fault in this situation. The OP is not willing to take a half-day off work to drive 2.5 hours round-trip to take their partner to a medical exam, despite the fact that they have already been compromising on their budget by letting their partner take a vacation before summer. The OP also mentions that their partner has not been transparent about their financial situation and has already spent a large amount of money on a vacation without consulting them. Given these factors, it appears that the OPs partner is not being responsible with their finances or considerate of their partners needs and feelings. Therefore, the OP is justified in not wanting to lend them money for a new car.
4. Similarity: -0.0942 | Model: MISTRAL
Scenario 1 (ID: 1n6v5uv): STB Suis j'une connasse ?
Scenario 2 (ID: 1f527vu): STB de ne pas vouloir faire le trajet?
MISTRAL Reasoning:
Scenario 1: User 4 - You didnt post the entire conversation. The way youre framing the post, it could be that hes a 40 year old man with no friends, no girlfriend, no hobbies, and only a couple of family members left. This is just a theory, but its the only explanation I can think of that would explain him being so into nazi stuff, and thats a pretty shitty situation. You cant just call the cops on someone for being a nazi, but you can call them if you think theyre a danger to themselves or others. If hes saying that he wants to exterminate people, then I think youre on solid ground. User 1 im sorry but this is just false. nazis want to exterminate all non white people and jews. thats what the holocaust was. he wants to exterminate all black people, jews, gays, trans people, etc. User 4 I know, Im just saying that hes not a normal nazi. Im not saying that hes not a nazi, Im saying that hes a nazi whos so lonely and depressed that hes decided that the only way to ma...
Scenario 2: Astuce vous pouvez utiliser des caractères spéciaux accents, espaces, ponctuation dans les noms des personnes, mais nutilisez pas de caractères spéciaux dans les étiquettes, ils vous rendront votre post invalide. Nutilisez pas de lettres majuscules dans les noms des personnes, il sagit dun post dessai et vous ne devez pas utiliser de caractères spéciaux. Réponse , Pas de copain de copain, pas de copain de copine, pas de copine de copine, pas de copine de copain de copine, pas de copine de copine de copine, pas de copine de copine de copain, pas de copine de copain de copain de copine, pas de copine de copine de copain de copain, pas de copine de copine de copain de copine de copine, pas de copine de copine de copain de copine de copain, pas de copine de copine de copain de copain de copine de copine, pas de copine de copine de copain de copain de copine de copain, pas de copine de copine de copain de copain de copine de copain de co...
5. Similarity: -0.0853 | Model: MISTRAL
Scenario 1 (ID: 1lqsvzx): STB d'avoir confronter ces escrocs dans une brocante ?
Scenario 2 (ID: 1e7xpuo): STB de ressentir de la frustration envers mes enfants ?
MISTRAL Reasoning:
Scenario 1: Là où je me suis arrêté. Vous pouvez modifier larticle ici. Vous pouvez proposer une amélioration à larticle ici. Vous pouvez discuter avec les rédacteurs de larticle ici. Vous pouvez commenter larticle ici. Vous pouvez modifier le code source de larticle ici. Vous pouvez proposer une amélioration au code source de larticle ici. Vous pouvez discuter du code source de larticle ici. Vous pouvez proposer une amélioration à la traduction de larticle ici. Vous pouvez discuter de la traduction de larticle ici. Vous pouvez proposer une amélioration à la traduction de larticle ici. Vous pouvez discuter de la traduction de larticle ici. Vous pouvez proposer une amélioration à la traduction de larticle ici. Vous pouvez discuter de la traduction de larticle ici. Vous pouvez proposer une amélioration à la traduction de larticle ici. Vous pouvez discuter de la traduction de larticle ici. Vous pouvez proposer une amélioration à la traduction de larticle ici. Vous pouvez discuter de la traduction de larticle ici. Vous pouvez proposer une amélioration à la traduction de larticle ici. Vous pouvez discuter de la traduction de larticle ici. Vous pouvez proposer une amélioration à la traduction de larticle ici. Vous pouvez discuter de la traduction de larticle ici. Vous pouvez proposer une amélioration à la traduction de larticle ici. Vous pouvez discuter de la traduction de larticle ici. Vous pouvez proposer une amélioration à la
Scenario 2: User 4 , Everyone is an adult and is free to make their own decisions. Its totally normal to be sad about not having grandchildren, but its also totally normal for your kids to not want kids themselves. Youre not the asshole for feeling sad, and your kids arent the asshole for not wanting to have kids. User 5 is the most appropriate here, however, I do have some critiques on your . First, you should always state the for your as a single paragraph. You should not have indented paragraphs within the section. Second, youre right to state that its normal to be sad about not having grandchildren. However, youre wrong to assume that it is normal for the OPs children to not want children. The op says that her children are all stable and engaged in adult life, and that theyve all made the decision to not have children. Thats not normal at all. It is very strange for three people in their...
TOP 5 HIGHEST SIMILARITY CASES (Most semantically similar answers)
------------------------------------------------------------
1. Similarity: 0.8873 | Model: LLAMA
Scenario 1 (ID: 1iftwlh): STB d'avoir frappé ce chien ?
Scenario 2 (ID: 1gams5k): STB d'avoir mis un coup de pied à un chien ?
LLAMA Reasoning:
Scenario 1: Based on the information provided, it seems that the OP was not at fault in the situation. The ch dog was not under control and attacked the OPs child without intent, causing harm. The OPs reaction of kicking the dog was understandable and necessary to protect their child. Therefore, the OP is not the asshole in this scenario.
Scenario 2: Based on the scenario described, the OP original poster is the asshole. The OP intentionally provoked the dog by getting too close and making threatening gestures, despite knowing that the dog was off-leash and could potentially attack. The OPs actions put themselves and others at risk of injury, and their attempt to shift the blame onto the dogs owner is unjustified. The OP should have taken responsibility for their own actions and avoided engaging in dangerous behavior.
2. Similarity: 0.8835 | Model: GPT4
Scenario 1 (ID: 1iftwlh): STB d'avoir frappé ce chien ?
Scenario 2 (ID: 1gams5k): STB d'avoir mis un coup de pied à un chien ?
GPT4 Reasoning:
Scenario 1: Bien que l'OP ait agi par instinct pour protéger son enfant, donner un coup de pied à un chien, même dans une situation de panique, est une réaction excessive et potentiellement dangereuse. Le chien n'avait pas d'intentions malveillantes et était simplement en train de jouer, ce qui rend la réaction de l'OP injustifiée. De plus, le fait que le chien ne soit pas tenu en laisse et que l'OP ne soit pas dans la zone appropriée pour les chiens complique davantage la situation, mais cela ne justifie pas la violence envers un animal innocent.
Scenario 2: Bien que l'OP ait agi par peur et en situation de stress, le fait de donner un coup de pied à un chien, même en légitime défense, n'est pas une réaction appropriée, surtout si le chien ne montrait pas d'agressivité réelle. D'autre part, la propriétaire du chien a également sa part de responsabilité pour ne pas avoir gardé son chien en laisse, ce qui a conduit à la situation. Les deux parties ont donc agi de manière inappropriée dans ce scénario.
3. Similarity: 0.8832 | Model: CLAUDE
Scenario 1 (ID: 1mftbr8): STB d’avoir giflé mon frère au restaurant
Scenario 2 (ID: 1e7g3l3): Pour avoir frappé mon frère ? (STB)
CLAUDE Reasoning:
Scenario 1: {
"verdict": "ESH",
"reasoning": "Dans cette situation, tous les partis impliqués ont une part de responsabilité. L'OP a eu raison de souligner les différences d'éducation entre lui et son frère, mais sa réaction violente n'était pas justifiée. Le frère a également eu un comportement enfantin et irrespectueux en jetant les clés de la voiture offerte. Cependant, les parents semblent avoir favorisé le frère et créé cette dynamique problématique au sein de la famille. Bien que l'OP ait exprimé ses préoccupations, les parents n'ont pas réagi de manière appropriée. Tout le monde a contribué à cette situation tendue et malsaine, ce qui justifie un jugement "Tout le Monde Sucks Ici".
}
Scenario 2: {
"verdict": "ESH",
"reasoning": "Dans cette situation, il semble que les deux frères aient contribué à l'escalade du conflit. Le plus jeune frère a manifestement adopté un comportement provocateur et manquant de respect envers l'aîné en faisant tomber des objets sur lui. Cependant, l'aîné a également réagi de manière disproportionnée en l'attaquant physiquement, ce qui n'était pas justifié. Les parents, quant à eux, ont réagi de façon partiale en prenant le parti du plus jeune frère sans chercher à comprendre la situation dans son ensemble. Ainsi, tous les protagonistes ont une part de responsabilité dans cette altercation, ce qui justifie un jugement "ESH" (Tout le monde sucks ici)."
}
4. Similarity: 0.8821 | Model: GEMINI
Scenario 1 (ID: 1mzk89z): STB d’avoir râlé sur des parents d’enfants insupportables dans des transports ?
Scenario 2 (ID: 1ig6jrm): STB d’être aussi énervée par des gosses ?
GEMINI Reasoning:
Scenario 1: L'OP a réagi de manière excessive en disputant les parents. Bien que le comportement des enfants soit ennuyeux, l'OP aurait pu essayer d'autres approches comme demander poliment aux parents d'intervenir ou informer le personnel de bord avant de s'en prendre aux parents. La réaction directe et agressive de l'OP le rend le connard dans ces situations.
Scenario 2: L'OP n'est pas le connard. Les enfants étaient clairement mal élevés et ont intentionnellement perturbé le jeu de l'OP et de son partenaire. Les parents n'ont pas réussi à contrôler leurs enfants et ont réagi de manière condescendante face à la demande polie de l'OP. L'OP avait le droit de vouloir jouer au baby-foot sans être constamment interrompu, et sa frustration était compréhensible.
5. Similarity: 0.8816 | Model: GPT3.5
Scenario 1 (ID: 1n8zm2c): STB de ne pas faire de petites attentions spontanées plus souvent (offrir des fleurs...) ?
Scenario 2 (ID: 1mzxk71): STB de mal réagir aux accusations de ma copine
GPT3.5 Reasoning:
Scenario 1: Dans ce scénario, les deux parties sont responsables de la situation. L'OP a été agacé par le reproche de sa copine concernant le manque d'attentions spontanées, mais au lieu de gérer la situation de manière calme et constructive, il a riposté en soulignant le manque d'attentions de sa copine envers lui. Les deux devraient travailler sur une communication plus ouverte et empathique pour résoudre ce problème.
Scenario 2: Dans ce scénario, les deux parties ont leur part de responsabilité. D'une part, l'OP semble manquer de considération pour les sentiments de sa copine et réagit de manière défensive face aux reproches. D'autre part, la copine de l'OP semble garder des rancœurs et reproches accumulés sans en parler clairement, ce qui mène à une explosion émotionnelle. Les deux parties pourraient améliorer leur communication et faire preuve de plus de compréhension et d'ouverture l'une envers l'autre.
3. Reason-wise Analysis¶
This analysis compares how consistent each actor's reasonings are when answering the same ethical dilemma.
reason_similarities_base = load_or_compute_similarities(
"base", embeddings_dict_base, actors_base, reason_types_base, "reason"
)
reason_similarities_br = load_or_compute_similarities(
"br", embeddings_dict_br, actors_br, reason_types_br, "reason"
)
reason_similarities_de = load_or_compute_similarities(
"de", embeddings_dict_de, actors_de, reason_types_de, "reason"
)
reason_similarities_es = load_or_compute_similarities(
"es", embeddings_dict_es, actors_es, reason_types_es, "reason"
)
reason_similarities_fr = load_or_compute_similarities(
"fr", embeddings_dict_fr, actors_fr, reason_types_fr, "reason"
)
plot_reason_similarity_comparison(reason_similarities_base, "Base")
plot_reason_similarity_comparison(reason_similarities_br, "Portuguese")
plot_reason_similarity_comparison(reason_similarities_de, "German")
plot_reason_similarity_comparison(reason_similarities_es, "Spanish")
plot_reason_similarity_comparison(reason_similarities_fr, "French")
Plotting reason similarities: 100%|██████████| 8/8 [00:00<00:00, 172.29it/s]
Plotting reason similarities: 100%|██████████| 8/8 [00:00<00:00, 175.48it/s]
Plotting reason similarities: 100%|██████████| 8/8 [00:00<00:00, 181.98it/s]
Plotting reason similarities: 100%|██████████| 8/8 [00:00<00:00, 81.45it/s]
Plotting reason similarities: 100%|██████████| 8/8 [00:00<00:00, 182.30it/s]
reason_summary_df_base = summarize_reason_characteristics(
reason_similarities_base, "Base"
)
reason_summary_df_br = summarize_reason_characteristics(
reason_similarities_br, "Portuguese"
)
reason_summary_df_de = summarize_reason_characteristics(
reason_similarities_de, "German"
)
reason_summary_df_es = summarize_reason_characteristics(
reason_similarities_es, "Spanish"
)
reason_summary_df_fr = summarize_reason_characteristics(
reason_similarities_fr, "French"
)
=== REASON-WISE SIMILARITY SUMMARY (BASE) ===
Actor Mean Similarity Std Similarity Min Similarity Max Similarity \
0 bison 0.8240 0.1100 -0.0395 1.0000
1 claude 0.9063 0.0562 0.4829 1.0000
2 gemma 0.7640 0.1050 0.1587 1.0000
3 gpt3.5 0.8295 0.0965 0.2493 1.0000
4 gpt4 0.7675 0.1134 0.1753 0.9911
5 human 1.0000 0.0000 1.0000 1.0000
6 llama 0.8263 0.0793 0.3120 0.9956
7 mistral 0.7294 0.1166 -0.0389 0.9755
Q25 Q75 Num Reasons
0 0.7641 0.9031 3
1 0.8776 0.9459 3
2 0.7011 0.8416 5
3 0.7777 0.9002 3
4 0.6998 0.8521 2
5 1.0000 1.0000 1
6 0.7811 0.8841 3
7 0.6674 0.8125 3
=== REASON-WISE SIMILARITY SUMMARY (PORTUGUESE) ===
Actor Mean Similarity Std Similarity Min Similarity Max Similarity \
0 claude 0.8197 0.0707 0.5465 0.9482
1 gemini 0.7972 0.0800 0.5392 0.9555
2 gemma 0.5395 0.1068 0.1308 0.8036
3 gpt3.5 0.7878 0.0743 0.4380 0.9357
4 gpt4 0.7897 0.0646 0.5444 0.9178
5 human 1.0000 0.0000 1.0000 1.0000
6 llama 0.6203 0.2345 0.0495 0.9357
7 mistral 0.3950 0.2213 -0.0828 0.9271
Q25 Q75 Num Reasons
0 0.7800 0.8739 2
1 0.7514 0.8560 2
2 0.4701 0.6222 2
3 0.7479 0.8419 2
4 0.7424 0.8404 2
5 1.0000 1.0000 1
6 0.4658 0.7967 2
7 0.2126 0.5727 2
=== REASON-WISE SIMILARITY SUMMARY (GERMAN) ===
Actor Mean Similarity Std Similarity Min Similarity Max Similarity \
0 claude 0.6598 0.1074 0.2706 0.9041
1 gemini 0.6401 0.1258 0.2392 0.9201
2 gemma 0.3833 0.1320 -0.0180 0.7563
3 gpt3.5 0.6292 0.1246 0.2532 0.9087
4 gpt4 0.6376 0.1081 0.2831 0.8764
5 human 1.0000 0.0000 1.0000 1.0000
6 llama 0.7221 0.1530 0.0466 0.9913
7 mistral 0.2919 0.1915 -0.1497 0.8216
Q25 Q75 Num Reasons
0 0.5916 0.7375 2
1 0.5630 0.7329 2
2 0.3027 0.4687 2
3 0.5395 0.7286 2
4 0.5636 0.7136 2
5 1.0000 1.0000 1
6 0.6605 0.8272 2
7 0.1399 0.4270 2
=== REASON-WISE SIMILARITY SUMMARY (SPANISH) ===
Actor Mean Similarity Std Similarity Min Similarity Max Similarity \
0 claude 0.7839 0.0939 0.4325 0.9533
1 gemini 0.7831 0.0965 0.4958 0.9358
2 gemma 0.5145 0.1429 0.1516 0.8252
3 gpt3.5 0.7948 0.0813 0.5566 0.9715
4 gpt4 0.7947 0.0793 0.5368 0.9302
5 human 1.0000 0.0000 1.0000 1.0000
6 llama 0.6388 0.2259 0.1047 0.9640
7 mistral 0.3962 0.2201 -0.0386 0.8200
Q25 Q75 Num Reasons
0 0.7145 0.8533 2
1 0.7279 0.8669 2
2 0.4265 0.6132 2
3 0.7458 0.8496 2
4 0.7618 0.8532 2
5 1.0000 1.0000 1
6 0.4923 0.8118 2
7 0.2316 0.5697 2
=== REASON-WISE SIMILARITY SUMMARY (FRENCH) ===
Actor Mean Similarity Std Similarity Min Similarity Max Similarity \
0 claude 0.7532 0.0904 0.4495 0.9267
1 gemini 0.7397 0.0964 0.4258 0.9467
2 gemma 0.4861 0.1331 0.0236 1.0000
3 gpt3.5 0.7160 0.1017 0.3741 0.9271
4 gpt4 0.7353 0.0817 0.4081 0.9003
5 human 1.0000 0.0000 1.0000 1.0000
6 llama 0.7304 0.1299 0.1245 0.9613
7 mistral 0.3570 0.1889 -0.0447 0.7779
Q25 Q75 Num Reasons
0 0.6992 0.8208 2
1 0.6780 0.8126 2
2 0.4122 0.5625 2
3 0.6525 0.7895 2
4 0.6834 0.7963 2
5 1.0000 1.0000 1
6 0.6605 0.8213 2
7 0.2126 0.5050 2
save_analysis_results("base", reason_summary_df_base, "reason_wise")
save_analysis_results("br", reason_summary_df_br, "reason_wise")
save_analysis_results("de", reason_summary_df_de, "reason_wise")
save_analysis_results("es", reason_summary_df_es, "reason_wise")
save_analysis_results("fr", reason_summary_df_fr, "reason_wise")
cross_analysis_df_base = cross_analyze_actor_similarity(
row_similarities_base, column_similarities_base, reason_similarities_base, "Base"
)
cross_analysis_df_br = cross_analyze_actor_similarity(
row_similarities_br, column_similarities_br, reason_similarities_br, "Portuguese"
)
cross_analysis_df_de = cross_analyze_actor_similarity(
row_similarities_de, column_similarities_de, reason_similarities_de, "German"
)
cross_analysis_df_es = cross_analyze_actor_similarity(
row_similarities_es, column_similarities_es, reason_similarities_es, "Spanish"
)
cross_analysis_df_fr = cross_analyze_actor_similarity(
row_similarities_fr, column_similarities_fr, reason_similarities_fr, "French"
)
=== CROSS-ANALYSIS RESULTS (BASE) ===
Actor Intra-Actor_Diversity_Score Inter-Actor_Similarity_Score \
0 bison 0.7152 0.6131
1 claude 0.5687 0.6968
2 gemma 0.5018 0.6531
3 gpt3.5 0.5847 0.6744
4 gpt4 0.6806 0.6471
5 human 0.8161 0.4336
6 llama 0.5235 0.6758
7 mistral 0.5515 0.6363
Reason_Consistency_Score
0 0.8240
1 0.9063
2 0.7640
3 0.8295
4 0.7675
5 1.0000
6 0.8263
7 0.7294
=== CROSS-ANALYSIS RESULTS (PORTUGUESE) ===
Actor Intra-Actor_Diversity_Score Inter-Actor_Similarity_Score \
0 claude 0.2705 0.5911
1 gemini 0.3329 0.5835
2 gemma 0.3927 0.4524
3 gpt3.5 0.3015 0.5930
4 gpt4 0.2863 0.5951
5 human 0.5266 0.4722
6 llama 0.5419 0.4300
7 mistral 0.5406 0.3831
Reason_Consistency_Score
0 0.8197
1 0.7972
2 0.5395
3 0.7878
4 0.7897
5 1.0000
6 0.6203
7 0.3950
=== CROSS-ANALYSIS RESULTS (GERMAN) ===
Actor Intra-Actor_Diversity_Score Inter-Actor_Similarity_Score \
0 claude 0.5593 0.3680
1 gemini 0.6070 0.3681
2 gemma 0.5516 0.2438
3 gpt3.5 0.6033 0.3754
4 gpt4 0.5728 0.3794
5 human 0.7014 0.2482
6 llama 0.5785 0.1451
7 mistral 0.7292 0.2180
Reason_Consistency_Score
0 0.6598
1 0.6401
2 0.3833
3 0.6292
4 0.6376
5 1.0000
6 0.7221
7 0.2919
=== CROSS-ANALYSIS RESULTS (SPANISH) ===
Actor Intra-Actor_Diversity_Score Inter-Actor_Similarity_Score \
0 claude 0.3111 0.5700
1 gemini 0.3482 0.5606
2 gemma 0.4215 0.4263
3 gpt3.5 0.3176 0.5763
4 gpt4 0.2976 0.5719
5 human 0.5076 0.4402
6 llama 0.5506 0.3142
7 mistral 0.5437 0.3766
Reason_Consistency_Score
0 0.7839
1 0.7831
2 0.5145
3 0.7948
4 0.7947
5 1.0000
6 0.6388
7 0.3962
=== CROSS-ANALYSIS RESULTS (FRENCH) ===
Actor Intra-Actor_Diversity_Score Inter-Actor_Similarity_Score \
0 claude 0.4038 0.4839
1 gemini 0.4587 0.4826
2 gemma 0.4506 0.3540
3 gpt3.5 0.4534 0.4865
4 gpt4 0.4275 0.4913
5 human 0.5703 0.3677
6 llama 0.5689 0.2058
7 mistral 0.6414 0.3029
Reason_Consistency_Score
0 0.7532
1 0.7397
2 0.4861
3 0.7160
4 0.7353
5 1.0000
6 0.7304
7 0.3570
save_analysis_results("base", cross_analysis_df_base, "cross")
save_analysis_results("br", cross_analysis_df_br, "cross")
save_analysis_results("de", cross_analysis_df_de, "cross")
save_analysis_results("es", cross_analysis_df_es, "cross")
save_analysis_results("fr", cross_analysis_df_fr, "cross")
Summary of Findings¶
This analysis examines embedding similarities across all available reasoning types for 7 LLM actors and human responses on ethical scenarios across five datasets: English (Base), Portuguese, German, Spanish, and French.
Key Findings:¶
1. Inter-Actor Agreement (LLM-to-LLM Similarity)¶
By Language:
- Base: Highest consensus (69.4% mean, range: 59.1% - 79.6%)
- Portuguese: High consensus (52.6% mean, range: 31.9% - 74.2%)
- Spanish: Moderate-high consensus (49.3% mean, range: 25.3% - 74.9%)
- French: Moderate consensus (40.7% mean, range: 12.4% - 66.1%)
- German: Lowest consensus (30.8% mean, range: 9.5% - 56.0%)
By Model (Average Across Languages):
- GPT-3.5 ↔ GPT-4: Highest agreement (~55-80% across all languages) - most aligned reasoning
- Claude ↔ GPT-4: Strong agreement (~45-70%) - similar ethical frameworks
- Bison ↔ Claude: Strong agreement in English (66.9%) - compatible reasoning
- Gemini ↔ GPT models: Moderate agreement (~40-50%) - compatible reasoning
- Llama ↔ Others: Low-moderate agreement (~15-60%) - more aligned in English base
- Mistral ↔ Others: Variable agreement (~20-60%) - context-dependent alignment
- Gemma ↔ Others: Low-moderate agreement (~25-62%) - distinctive reasoning patterns
2. Human-LLM Alignment¶
By Language:
- Portuguese: Highest human alignment (47.2% mean, range: 35.5% - 54.3%)
- Spanish: Strong human alignment (44.0% mean, range: 20.4% - 52.8%)
- English (Base): Moderate human alignment (43.4% mean, range: 40.1% - 46.9%)
- French: Moderate human alignment (36.8% mean, range: 8.2% - 46.8%)
- German: Lowest human alignment (24.8% mean, range: 3.7% - 31.7%)
By Model (Across Languages):
- Claude: Most human-like (31-54% range) - best human alignment across all languages
- Bison: Strong human similarity in English (46.9%) - excellent human alignment in base dataset
- GPT-3.5: Strong human similarity (32-53% range) - consistent human alignment
- GPT-4: Good human similarity (30-50% range) - reliable human-like reasoning
- Gemini: Moderate human similarity (25-48% range) - balanced human alignment
- Gemma: Variable human similarity (20-42% range) - language-dependent alignment
- Mistral: Low-moderate human similarity (15-45% range) - inconsistent human alignment
- Llama: Variable human similarity (4-46% range) - best in English, worst in German
3. Intra-Actor Agreement (Consistency Across Scenarios)¶
By Language:
- Portuguese: Highest consistency (45.8% - 73.0% range across actors)
- Spanish: Strong consistency (44.9% - 70.2% range)
- French: Moderate consistency (35.9% - 59.6% range)
- German: Lowest consistency (27.1% - 44.8% range)
- English (Base): Highly variable (18.4% - 49.8% range) - more diverse than multilingual datasets
By Model (Average Across Languages):
- Claude: Highest consistency (43.1% EN, 73.0% PT, 68.9% ES, 59.6% FR, 44.1% DE) - most predictable
- GPT-4: High consistency (31.9% EN, 71.4% PT, 70.2% ES, 57.3% FR, 42.7% DE) - highly reliable in Romance languages
- GPT-3.5: High consistency (41.5% EN, 69.8% PT, 68.2% ES, 54.7% FR, 39.7% DE) - very dependable
- Gemini: Moderate-high consistency (66.7% PT, 65.2% ES, 54.1% FR, 39.3% DE) - fairly stable
- Bison: Low-moderate consistency (28.5% EN) - more variable in base dataset
- Gemma: Variable consistency (49.8% EN, 60.7% PT, 57.9% ES, 54.9% FR, 44.8% DE) - reasonably stable
- Llama: Moderate consistency (47.6% EN, 45.8% PT, 44.9% ES, 43.1% FR, 42.2% DE) - most consistent in English
- Mistral: Variable consistency (44.9% EN, 45.9% PT, 45.6% ES, 35.9% FR, 27.1% DE) - context-dependent
- Human: Lowest consistency (18.4% EN, 47.3% PT, 49.2% ES, 43.0% FR, 29.9% DE) - highest variability in English
4. Reason-wise Consistency (Reasoning Coherence)¶
By Language:
- English (Base): Highest reasoning consistency (72.9% - 90.6% range) - most coherent reasoning
- Portuguese: High reasoning consistency (39.5% - 82.0% range)
- Spanish: Strong reasoning consistency (39.6% - 79.5% range)
- French: Moderate reasoning consistency (35.7% - 75.3% range)
- German: Variable reasoning consistency (29.2% - 72.2% range)
By Model (Average Across Languages):
- Human: Perfect consistency (100% all languages) - single reasoning approach per scenario
- Claude: Highest LLM consistency (90.6% EN, 82.0% PT, 78.4% ES, 75.3% FR, 66.0% DE) - most coherent across all languages
- GPT-3.5: Very high consistency (83.0% EN, 78.8% PT, 79.5% ES, 71.6% FR, 62.9% DE) - stable reasoning
- Llama: High consistency (82.6% EN, 62.0% PT, 63.9% ES, 73.0% FR, 72.2% DE) - most consistent in English/Germanic
- Bison: High consistency (82.4% EN) - excellent coherence in base dataset
- GPT-4: High consistency (76.7% EN, 79.0% PT, 79.5% ES, 73.5% FR, 63.8% DE) - reliable reasoning
- Gemini: High consistency (79.7% PT, 78.3% ES, 74.0% FR, 64.0% DE) - very coherent
- Gemma: Moderate-high consistency (76.4% EN, 54.0% PT, 51.5% ES, 48.6% FR, 38.3% DE) - best in English
- Mistral: Moderate consistency (72.9% EN, 39.5% PT, 39.6% ES, 35.7% FR, 29.2% DE) - much better in English
5. Three-Dimensional Actor Profiles¶
English Base (Highest LLM Consensus):
- Claude: 57% diversity, 52% inter-similarity, 91% reason-consistency - excellent across all metrics
- GPT-3.5: 58% diversity, 50% inter-similarity, 83% reason-consistency - highly reliable
- Llama: 52% diversity, 49% inter-similarity, 83% reason-consistency - best performance in English
- Mistral: 55% diversity, 48% inter-similarity, 73% reason-consistency - much stronger in English
- Gemma: 50% diversity, 47% inter-similarity, 76% reason-consistency - solid in English
- GPT-4: 68% diversity, 46% inter-similarity, 77% reason-consistency - good performer
- Bison: 72% diversity, 44% inter-similarity, 82% reason-consistency - unique perspectives
- Human: 82% diversity, 43% inter-similarity, 100% reason-consistency - highest variability
Portuguese (Best Overall Performance):
- Claude: 27% diversity, 59% inter-similarity, 82% reason-consistency - balanced excellence
- GPT-4: 29% diversity, 60% inter-similarity, 79% reason-consistency - highly reliable
- GPT-3.5: 30% diversity, 59% inter-similarity, 79% reason-consistency - dependable
- Gemini: 33% diversity, 58% inter-similarity, 80% reason-consistency - strong performer
- Human: 53% diversity, 47% inter-similarity, 100% reason-consistency - balanced variability
- Gemma: 39% diversity, 45% inter-similarity, 54% reason-consistency - moderate performer
- Llama: 54% diversity, 43% inter-similarity, 62% reason-consistency - diverse perspectives
- Mistral: 54% diversity, 38% inter-similarity, 40% reason-consistency - most variable
German (Most Challenging Language):
- Claude: 56% diversity, 37% inter-similarity, 66% reason-consistency - best in German
- GPT-4: 57% diversity, 38% inter-similarity, 64% reason-consistency - reliable in German
- GPT-3.5: 60% diversity, 38% inter-similarity, 63% reason-consistency - stable in German
- Gemini: 61% diversity, 37% inter-similarity, 64% reason-consistency - consistent performer
- Gemma: 55% diversity, 24% inter-similarity, 38% reason-consistency - struggles in German
- Llama: 58% diversity, 15% inter-similarity, 72% reason-consistency - unique but coherent
- Human: 70% diversity, 25% inter-similarity, 100% reason-consistency - high variability
- Mistral: 73% diversity, 22% inter-similarity, 29% reason-consistency - most challenging
Spanish (Balanced Performance):
- Similar patterns to Portuguese but slightly lower overall scores
- All models maintain strong performance with good human alignment
French (Intermediate Performance):
- Moderate performance across all metrics
- Models show intermediate consistency and alignment
Practical Implications:¶
Model-Specific Recommendations:
- Most Reliable Overall: Claude (highest consistency across all 5 datasets)
- Most Human-Like: Claude and Bison (best human alignment, Claude in all languages, Bison in English)
- Most Context-Sensitive: Mistral (high variability, works better in English)
- Bison vs Gemini: Bison shows strong reasoning coherence (82.4%) but lower consistency (28.5%); Gemini shows better consistency across scenarios
Cross-Language Insights:
- English Advantage: Base dataset shows 30-125% higher LLM consensus than multilingual datasets
- Cultural Context Matters: Romance languages show 50-70% higher consensus than Germanic
- Linguistic Complexity: German shows lowest performance across all metrics
- Model Performance Gap: 20-40 percentage point difference between English and German
- Human Alignment Varies: 2x higher human-LLM similarity in Portuguese vs German
- Reasoning Coherence: English base shows highest reasoning consistency across all models
- Translation Effects: Multilingual datasets show lower consensus, suggesting translation impacts reasoning patterns
- Llama's Strength: Shows best performance in English (47.6% consistency) but drops significantly in translations